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1  Introduction 

1.1  Problem  Description 


The  motivation  for  this  research  arises  from  the  maneuvering  and  target  tracking  func¬ 
tions  currently  envisioned  for  large  space  structure  (LSS)  optical  tracking  systems.  Such 
systems  consist  of  lightweight  structures  which  are  mounted  on,  or  articulated  with  respect 
to,  heavier  less  flexible  structures.  Within  the  lightweight  structure  is  an  optical  system, 
and  within  that  system  there  are  subsystems  having  similar  further  divisions.  For  example, 
a  typical  LSS  optical  system  might  consist  of  the  following  subsystems,  listed  here  in  order 
of  increasing  bandwidth:  (1)  coarse  tracking  of  rigid  body  dynamics  via  gimbals,  CMG’s, 
reaction  jets,  and  isolation  devices;  (2)  beam  pointing  via  mirror  actuators;  (3)  wavefront 
and  focus  via  secondary  mirror  control;  and  (4)  fine  tracking  via  fast  mirrors  and  track  illu¬ 
mination.  Due  to  the  natural  frequency  separation  between  each  descending  subsystem,  it 
is  possible  to  view  each  as  essentially  an  independent  tracking  problem. 

With  this  vie.w  in  mind  our  recLsearch  has  concentrated  on  a  more  generic  and  fundamental 
objective,  namely,  feedback  control  for  rapid  precision  maneuvering  of  flexible  structures. 
This  objective  encompasses  any  or  all  of  the  typical  subsystems  involved  in  an  LSS  optical 
tracking  system.  Our  specific  aims  were  as  follows: 

(1)  Determine  the  performance  limitations  for  rapid  slewing. 

(2)  Develop  methodologies  to  implement  feedback  slewing  controllers  which  achieve  sjreci- 

fied  performance  goals. 


1.2  Overview  of  Results 

The  limitations  of  tracking  performance,  particularly  target  acquisition,  arise  irrincipally 
from  actuator  saturation.  Since  saturation  is  a  nonlinear  function,  it  is  not  surprising  that 
high  performance  slewing  controllers  are  nonlinear.  The  rapid  slowing  prohlcm,  wliich  is 
perhaps  more  well  known  as  the  time-optimal  control  problem,  has  been  the  subject  of 
research  for  very  many  years.  Mathematically,  a  complete  formulation  of  this  predflem  can  be 
obtained  using  Pontryagiri’s  Maximum  Principle,  c.g.,  [Athl],  However,  in  only  the  simplest 
cases  has  it  been  possible  to  obtain  a  solution.  The  general  solution  recpiires  solving  a  two- 
point  boundary  value  problem.  In  addition,  for  practical  reasons,  such  as  roluistiie.ss  and 
disturbance  attenuation,  it  is  necessary  to  implement  tlie  time-optimal  control  as  a  feedback 
system.  However,  such  an  implementation  requires  determining  very  eoniplicated  multi¬ 
dimensional  switching  surfaces.  This  is  not  a  solved  problem  except  for  some  very  sim])le 
cases,  e.g.,  rigid  body  dynamics.  As  a  result,  the  research  challenge  undertaken  hen‘  is  the 
design  and  feedback  implementation  of  time-optimal  control  for  systems  with  the  complexity 
of  tyjrical  LSS  o;  ;al  tracking  systems. 

Within  the  s'  ■  ;  of  linear  (time-invariant  fiiiite-dimeiisioiial)  coiitrcd  design  and  analysis, 
there  exists  a  Iderablc  amount  of  theory  as  well  as  com])lem('ntaiy,  very  edicF'iit  eoin- 
putatioiial  tools.  Many  chissos  of  performance  specilications  can  be  included  in  the  design 


1 


methodology,  e.g.,  minimum  energy,  worst-case  gain,  etc.  Nonlinear  control  design,  however, 
is  at  its  infancy,  and  except  for  very  specific  cases,  a  performance  specification  other  than 
stability  is  very  hard  to  incorporate. 

The  minimum-time  control  design  problem  is  a  nonlinear  control  design  problem.  This 
is  due  to  the  fact  that  the  solution  relies  on  maximizing  the  available  actuation  energy  over 
the  finite  duration  of  actuation.  As  is  well  known  from  Pontryagin’s  Maximum  Principle,  for 
a  given  reference  the  actuation  is  bang-bang,  i.e.,  saturating  at  the  two  extremes.  Clearly 
a  scalar  multiple  of  the  reference  will  yield  a  similar  actuation  sequence  over  a  different 
duration.  Observe  that  a  closed-loop  implementation  is  acceptable  provided  that  it  generates 
the  (open-loop)  time-optimal  solution  at  the  plant  input. 

For  a  given  plant  model  and  reference  trajectory,  if  one  could  solve  the  associated 
minimum-time  problem,  one  would  obtain  the  description  of  the  input  in  terms  of  the 
costate.  This  costate-to-input  map  would  in  turn  describe  a  closed-loop  implementation. 
Such  a  point  of  view,  apart  from  the  nontrivial  issue  of  solving  the  minimum-time  problem, 
has  two  other  major  drawbacks: 


i)  A  time-optimal  maneuver  requires  an  actuation  sequence  which  does  not  excite  the 
structural  modes.  Since  the  structural  modes  can  only  be  determined  within  some 
tolerance,  what  is  optimal  for  the  design  model  may  not  prove  to  be  satisfactory  for 
the  perturbations  of  the  design  model.  In  other  words,  the  complete  solution  lacks 
information  on  the  achieved  robustness  in  performance. 

ii)  Regardless  of  the  plant  model,  the  solution  is  necessarily  bang-bang.  Hence,  one  re¬ 
quires  switching  logic  (sign  functions)  to  implement  the  costate-to-input  map.  Such 
an  implementation  is  known  to  chatter  for  small  error  signals. 


These  drawbacks  imply  that  one  should  step  back  from  the  minimum-time  and  seek 
api^roximate  solutions  which  will  make  the  clo.sed-loop  im])lementation  feasible  and  achieve 
performance  degradation  which  is  acceptable.  Within  the  scope  of  this  project,  two  distinct 
ajjproaches  have  been  taken  in  the  course  of  seeking  an  approximate  solution: 


•  Aj)]Moach  1  . 

Start  with  a  representative  problem  for  which  the  complete  analytical  solution  can  be 
obtained.  Modify  the  solution  to  obtain  an  iniplementablc  fast  tracking  closed-looj) 
system.  Such  an  approach  restricts  the  set  of  allowable  models.  However,  the  existence 
of  an  analytical  solution  provides  a  variety  of  extensions  to  be  made,  e.g.,  adaptation. 
Section  2  focuses  on  the  results  obtained  using  this  paiticuiar  approach. 

•  A])proach  2  . 

Instead  of  modifying  the  solution  to  a  sj)ecific  jjroblem,  pose  the  original  minimum- 
time  jjroblem  as  the  limit  of  a  sequence  of  easy  to  solve  convex  optimization  j)roblems. 
l‘br  a  fixed  member  of  this  sequence,  determine  a  closed-loop  design  scheiiK;  based 
on  the  .solution.  Such  an  apj)roach  increa.ses  computational  complexity  in  the  design 


stage.  However,  for  a  satisfactory  approximation  to  the  original  minimum-time  prob¬ 
lem,  the  closed-loop  design  makes  intensive  use  of  existing  linear  (time-invariant  finite¬ 
dimensional)  design  tools.  The  closed-loop  design  relies  on  a  model  follower  scheme. 
Section  3  focuses  on  the  results  obtained  using  this  particular  approach. 
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2  Approach  1 

This  phase  of  the  project  consists  of  the  research  leading  to  the  following  three  publications 
which  are  reprinted  in  Appendices  B  through  D  .  Part  of  this  research  was  motivated  by  the 
publication  in  Appendix  A  (under  an  NSF  grant),  which  is  included  for  completeness. 

1.  M.  L.  Workman,  R.  L.  Kosut  and  G.  F.  Franklin, 

“Adaptive  Proximate  Time-Optimal  Control:  Discrete-Time  Case,” 

Proceedings  of  the  IEEE  Conference  on  Decision  and  Control,  pp.  1548-1553, 

Los  Angeles,  California,  December  1987. 

2.  R.  L.  Kosut  ,  A.  M.  Pascoal,  M.  L.  Workman  and  G.  F.  Franklin, 

“Minimum-Time  Control  of  Large  Space  Structures,” 

Proceedings  of  the  SPIE  Conference, 

Los  Angeles,  California,  January  1988. 

3.  A.  M.  Pascoal,  R.  L.  Kosut,  G.  F.  Franklin,  D.  R.  Meldrum  and  M.  L.  Workman, 
“Adaptive  Time-Optimal  Control  of  Flexible  Structures,” 

Proceedings  of  the  American  Control  Conference, 

Pittsburgh,  Pennsylvania,  June  1989. 

During  this  period  Dr.  Kosut  collaborated  with  Prof.  Franklin  from  the  EE  Dept,  at 
Stanford  University  and  hi.s  graduate  students  M.L.  Workman  and  D.R.  Meldrum.  Dr.  Pas¬ 
coal  was  employed  by  ISI. 
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3  Approach  2 

3.1  Introduction^ 

Within  the  setting  of  finite-dimensional  linear  time-invariant  multi-input  multi-output  plant 
modeling/identification  and  control  design,  increased  computational  power  allows  higher 
order  nominal  plant  models  to  be  incorparated  in  the  designs  in  order  to  meet  stringent 
performance  demands.  Currently  used  design  methodologies  include  minimum-energy,  worst- 
case  energy-gain  and  peak-gain  designs.  In  the  case  of  actuator  saturation,  even  if  a  globally 
stabilizing  compensator  is  designed,  saturation  of  the  actuators  may  significantly  degrade 
the  overall  performance.  Typically,  one  forces  sufficient  conditions  to  guarantee  that  the 
actuator  signals  do  not  saturate  over  the  region  of  operation. 

The  minimum-time  tracking  (of  a  fixed  or  moving  target)  on  the  other  hand  becomes 
an  ill-posed  problem  if  the  saturation  of  the  actuators  is  not  incorporated  in  the  problem 
description.  Subject  to  the  specified  saturation  limits,  minimum-time  solutions  are  neces¬ 
sarily  bang-bang;  hence,  a  linear  control  design  will  prove  to  be  unsatisfactory.  Closing  the 
loop  aside,  solving  for  the  (or  a,  as  the  case  might  be)  time-optimal  input  is  a  nontrivial 
task.  Pontryagin’s  Maximum  Principle  (see  e.g.,  [Athl])  brings  a  complete  solution  to  the 
problem,  using  the  calculus  of  variations.  Solving  for  the  equivalent  nonlinear  (non-convex) 
non-differcntiable  programming  problem  brings  up  all  the  associated  hardships.  In  order  for 
such  an  approach  to  be  implemcntablc,  one  requires  a  characterization  of  the  switching  sur¬ 
faces  in  the  state-space.  Complete  solutions  for  single-input  single-output  low  order  models 
have  been  derived  in  the  literature;  these  results  arc  extremely  case  specific  and  they  are 
derivations  for  the  complete  solution  for  the  case  at  hand.  Regardless  of  what  the  solution  is, 
the  result  requires  a  relay  in  the  closed-loop;  hence,  chattering  subject  to  disturbances.  The 
way  to  overcome  this  difficulty  is  to  approximate  the  infinite-gain  nonlinearity  (namely,  the 
relay)  with  a  finite-gain  nonlinearity.  Hence  one  is  bound  to  step  back  from  the  time-optimal 
result  for  the  sake  of  implementation.  Even  if  one  reformulates  the  open-loop  minimum-time 
problem  as  a  fast  finite-time  tracking  problem  (for  a  given  reference  trajectory,  determine  an 
input  (subject  to  actuator  saturation)  so  that  the  tracking  error  remains  at  zero  after  a  finite 
time-instant)  a  feasible  solution  method  for  a  clo.scd-looj)  design  goal  remains  a  challenge. 

Fast  finite-time  tracking  of  reference  signals  with  saturating  actuators  has  been  a  bencli- 
mark  control  problem  since  the  complete  solution  method  to  the  open-looj)  problem  has  been 
derived  using  Pontryagiii’s  Maximum  Principle.  Solutions  for  the  rigid  body  approximations 
have  been  used  in  different  control  strategies  in  applications  areas  such  as  the  control  of  disk 
drives  and  precise  maneuvering  of  flexible  systems.  The  demand  for  fast  tracking  with  satu¬ 
rating  actuators  has  produced  a  variety  of  closed-loop  implementations  ranging  from  using 
idealized  relays,  finite-gain  relays,  adaptation  methods,  etc.  [WoiT,  Wor2,  Word]  .  In  the 
meantime,  openloop  .solution  to  the  problem  with  flexible  modes  has  been  studied  in  detail, 
for  specific  cases  with  no  damping.  The  resulting  proposed  nonlinear  optimization  problems 
are  derived  using  the  necessary  optimality  conditions  posed  by  the  Maximum  Principle,  for 
specific  plant  models  [Beni,  Sinl]  . 

In  this  report,  an  approximation  to  the  time-optimal  tracking  problem  under  actuator 

*An  abridged  version  of  a  .subset  of  tlie  following  results  is  to  appear  in  the  Proceedings  of  the  I  PEP 
Conference  on  Decision  and  Control,  Brighton,  United  Kingdom,  Decf'iiiber  1991  (see  Apix-ndix  M)  . 
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saturation  constraints  is  proposed  for  linear  time-invariant  finite-dimensional  multi-input 
multi-output  plants.  The  approximation  relies  on  a  transfer  function  approach  to  formulating 
constraints  on  the  set  of  admissible  finite-duration  signals  that  achieve  precise  point-to- 
point  positioning  of  flexible  structures  [Bhal]  .  The  results  arc  used  to  design  fast  finite¬ 
time  tracking  closed-loop  feedback  systems  using  only  linear  time-invariant  design  tools; 
the  design  replicates  the  open-loop  behaviour  with  the  added  advantages  of  feedback.  An 
example  illustrates  the  feasibility  of  the  approach. 

3,2  Motivation 

Consider  a  linear  time-invariant  finite-dimensional  multi-input  multi-output  plant  P  .  For 
preliminary  arguments,  consider  the  continuous-time  case  (the  solution  method  proposed 
relies  on  transfer  function  descriptions;  hence  it  naturally  extends  to  the  discrete- time  case). 
Consider  the  following  optimal  control  problem: 

•  For  a  given  reference  r  ,  find  the  minimum  time  T  such  that  a  7'-duration  control  input 
u  (Halloo  ^  1)  achieves  {P  *  u){t)  =  r{t)  for  alH  >  T  . 

Clearly,  not  all  signals  r  can  be  tracked  in  such  manner.  Once  the  right  class  of  r  is  described, 
this  problem  boils  down  to  the  standard  time-optimal  intercept  problem.  As  is  well  known, 
Pontryagin’s  Maximum  Principle  sets  up  the  necessary  conditions  to  solve  this  problem; 
however,  literally  computing  the  control  u  turns  out  to  be  a  nasty  nonlinear  optimization 
problem.  Even  if  we  had  the  the  time-optimal  Ur  that  tracked  r  in  Tr  seconds,  trying  this 
input  on  an  open-loop  plant  is  a  futile  attempt.  Consider  a  simple  double  integrator  plant 
with  r  as  a  step  input,  The  time-optimal  Ur  clearly  satisfies  Uj.{r)dT  =  0  ;  otherwise 
the  plant  would  start  to  tumble,  If  one  had  an  a-ctuation  disturbance  (say  a  short  duration 
pulse  with  a  DC  component)  there  would  be  no  way  to  implement  a  time-optimal  slew  in 
open-loop  without  exact  knowledge  of  the  input  disturbance.  Clearly,  one  requires  a  feedback 
configuration. 

For  the  double  integrator  plant,  for  any  stabilizing  compensator  (in  the  unity-feedback 
system)  ,  the  sensitivity  map  has  a  zero  at  s  =  0  ;  hence,  any  additive  DC  disturbance  at  the 
input  is  asymptotically  rejected  at  the  true  plant  input. Instead  of  choosing  any  stcibilizing 
compensator,  if  we  designed  an  -optimal  compensator,  we  will  have  a  worst-case  amplitude 
gain  from  the  input  disturbance  to  the  plant  input.  Using  this  headroom,  we  could  solve  the 
time-optimal  problem  with  the  actuation  constraint  modified  to  ||u||oo  <  (1  ~  £)  i  where 
E  G  (0,1)  ,  is  the  discount  factor  keeping  the  worst  case  disturbance  contribution.  In  other 
words,  the  closed-loop  design  (how  sophisticated  one  desires  to  get)  can  be  decoupled  from 
the  time-optimal  jjroblem  by  presetting  e  .  Clearly,  any  stable  unity-feedback  system  can  be 
used  to  track  r  once  the  time-optimal  (or  as  close  as  one  can  get)  Ur  is  determined  (inject  iir 
at  the  plant  inj)ut  and  use  Pur  as  the  reference  and  let  the  compensator  work  on  the  output 
measurement  error);  see  Figure  1  .  This  line  of  reasoning  works  fine  as  long  as  one  has  the 
suitable  Mr  i  ui  otlier  words,  tlie  open-loop  solution. 

So,  a  feasible  closed-loop  (robust,  disturbance  rejecting,  ...)  design  problem  boils  down  to 
computing  n^  in  a  fast  manner;  possibly,  after  certain  simplifications  and  getting  sul)-o])tinia.l 
results  within  a  reasonable  headroom. 
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Figure  1:  Stable  unity-feedback  system  with  zero  initial  conditions  that  achieves  y  =  Pur 


This  report  focuses  on  an  approximation  to  the  time-optimal  tracking  problem.  The 
result  can  be  expressed  in  terms  of  a  sequence  of  linear  programs,  v/here  the  number  of 
variables  is  fixed  throughout.  The  results  are  applicable  to  any  multi-input  multi-output  P 
with  a  minimal  state-space  description  (A,  B,C)  .  Tirne-suboptimal  tracking  is  achieved  for 
the  set  of  admissible  reference  signals  associated  with  P  .  The  results  are  incorporated  in  a 
closed-loop  design  procedure.  An  example  illustrates  the  fecisibility  of  the  approach. 


3.3  Finite-Time  Tracking  of  Reference  Signals 

Let  the  plant,  P  ,  be  strictly  proper  and  have  a  minimal  state-space  description  (A,  jB,C)  , 
with  Hi  inputs,  outputs  and  n,  states.  The  state-space  description  can  be  in  continuous¬ 
time  or  discrete-time.  For  the  sake  of  illustration,  the  results  will  be  exclusively  stated  for  the 
continuous-time  strictly  proper  multi-input  multi-output  plant  P  .  Since  the  approach  relies 
on  a  rational  transfer  function  description,  with  appropriate  modifications,  discrete-time 
setting  can  also  be  handled. 


3.3.1  Definition:  (Ut) 

For  a  given  T  €  (0,  oo)  ,  Ut  denotes  the  set  of  all  bounded^  inputs  of  duration  T  ,  where 

Ut  ;=  {  «  :  1R+  — ►  IR"'  |  u{t)  =  0  for  t>T  ,  Halloo  <  oo  }  ' 

Typically,  the  actuators  have  saturation  limits.  After  appropriate  normalization  of  the 
input  u  to  the  plant,  we  cissume  that  the  saturation  constraints  are  expressed  as  ||u|loo  1  . 


3.3.‘ii  Definition:  (2" -track) 

For  a  given  2’  €  (0,oo)  ,  y  is  said  to  T'-track  r  ill'  y{t)  =  r{t)  for  alH  >  2'  .  □ 

For  a  given  reference  signal  r  ,  the  minimum-time  tracking  problem  is  to  determ iiu  the. 
minimum  time  2’  and  an  input  u  C  Ut  ,  such  that  jju||oo  ^  1  and  [P  +  u)  2'-track;;  r  .  Since 
the  inputs  are  restricted  lo  Ur  ,  the  reference  should  T-track  a  zero-input  riisponse  of  P  . 

-with  .ospect  to  the  norm  II  •  llco  .  For  u  ;  111+ — ►  IIl"‘  ,  |lu||oo  :=  H"P(6nr+ h'i(<)l 
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3.3,3  Fact: 


For  a  given  plant  P  with  a  minimal  description  {AyD^C)  ,  and  a  reference  r  ,  there  exists 
an  input  u  ^  Ut  such  that  {P  *  u)  F-tracks  r  if  and  only  if  r  G  TZx  ,  where 

7^T  :=  {  r  :  III+  IR”“  !  r{i)  =  Cc^^xq  ,  xo  G  HI”'  ,  i  >  T  }  .  □  (1) 

Note  that  (1)  is  a  complete  description  of  reference  trajectories  that  the  plant  output  can 
T-track  with  inputs  in  Ut  •  From  now  on,  we  will  refer  to  the.set  Itx  as  the  set  of  admissible 
reference  trajectories.  Using  the  description  in  (1)  ,  any  r  G  Itr  has  a  unique  decomposition 
as  stated  in  the  following  fact. 

3.3.4  Fact: 

For  a  given  plant  P  with  a  minimal  description  {AyByC)  ,  let  r  G  'R.t  .  Then  there  exists  a 
unique  ry  G  IAt  and  a  unique  Xr  G  HI”'  such  that 

r{t)  —  rxit)  +  .  □ 

3.3.5  Finite-Time  Tracking  Problem  Vt  : 

For  a  given  r  G  ,  determine  a  u  €  Wj  such  that  {P  *  u)  T-tracks  r  .  □ 

Clearly,  the  problem  Vt  heis  always  a  solution,  since  there  is  no  constraint  on  the  actuator 
signal.  The  standard  analytical  solution  method  for  Vt  is  outlined  in  order  to  motivate  the 
results. 


3.3.6  Finite-time  tracking  with  unconstrained  actuation: 

Let  (yl.  By  C)  be  minimal;  without  loss  of  generality,  set  initial  time  instant  to  iiero.  We  now 
briefly  state  the  standard  solution  method. 

1.  Fix  T  G  (0,oo)  . 

2.  Fix  r  G  V-t  • 

3.  Determine  the  unique  x{T)  described  by  the  constraint 

r{i)  =  Ce^^‘-'^^x{T)  ,  t  >  'P  ; 


r  r{T)  ] 

CA 

x{T)  = 

r(U(7’) 

4.  Solve  for  Wc{T)  where  Wc{t)  satisfies 

Wa  =  TVkc  +  WcA^'  +  BB'^'  ,  Wc{0)  =  0 
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5.  Compute  the  finite-duration  minimum  energy  control  that  steers  xq  to  x{T)  ,  namely 


u{t) 


B'rc^^('^-^)WsHT)[x{T)~e^^Xo]  f  G  [0,r]  ^ 

0  t  >  T  ' 


(2) 


Recall  that  the  minimum  energy  solution  in  (2)  is  a  solution  of  the  linear  equation 

XT  =  Ct{u)  ,  (3) 

where 

XT  :=  x{T)  — 

Ct{u)  :=  f  Bu{T)dT  . 

Jo 

If  one  obtains  a  parametrization  of  the  null-space  of  Ct  ,  together  with  the  minimum 
energy  solution  in  (2)  ,  one  obtains  a  parametrization  of  all  solutions  to  (3)  .  Note  that,  the 
resulting  parametrization  is  a  parametrization  of  all  inputs  that  achieve  T-tracking  of  the 
reference  r  .  One  could  then  search  over  the  subspace  of  solutions  for  an  input  that  satisfied 
the  maximum  bounds. 

A  straightforward  approach  would  be  to  introduce  sampling  at  a  fixed  rate  and  zero- 
order  hold.  Instead  of  keeping  the  input  value  at  each  sampling  instant  as  an  independent 
parameter  (v/hich  would  increase  the  complexity  of  the  optimization  problem  considerably), 
hold  the  input  signal  constant  over  a  fixed  amount  of  sampling  instants  so  that  the  number 
of  free  parameters  is  reasonable.  The  equation  in  (3)  would  then  be  a  linear  equation  in  finite 
number  of  parameters;  together  with  the  parameter  bounds  imposed  by  the  saturation  limits, 
one  ends  up  with  a  standard  lineai’  programming  problem.  There  is  one  major  drawback  of 
this  approach:  setting  up  the  approximate  linear  programming  problem  requires  considerable 
amount  of  computation  (discretization,  time-domain  convolutions,  state-transition  matrix 
evaluations,  etc.)  . 

The  proposed  solution  method,  sets  up  another  linear  programming  problem  using  trans¬ 
fer  function  description  and  partial  fraction  expansions. 

Before  we  start  describing  another  way  of  parametrizing  inputs  in  U-j-  fb^t  achieve  7  - 
tracking  of  r  ,  we  point  out  the  fact  that  the  standard  minimum  energy  solution  to  Vr  can 
be  used  to  determine  a  lower  bound  on  the  7”s  the  search  is  based  on. 

For  a  signal  u  Ut  ^  Ih^lloo  <  1  implies  that  the  energy'^  ||u||2  <  7'n,  .  We  know  that 
the  minimum  energy  needed  to  steer  Xq  to  x[T)  is  given  by 


II 


v/here  (  =  x{T)  —  e'^'^Xo  .  Clearly,  if  ||fi||2  is  greater  than  7'n,-  ,  there  cannot  be  a  lionnded 
signal  in  Ut  can  do  the  job.  Later,  during  the  detailed  de.scrii>tion  of  an  cr^ainjde,  we  will 
use  this  energy  bound  condition  in  order  to  show  the  effect  of  the  proposed  ajiproxiinations 
to  the  mininnini-tiine  reachability  problem. 

^For  u  :  IR4.  -►  IR"’  ,  \\n\\?^  :=  u'' (T)u{T)(lr  . 
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3.4  Transfer  Function  Approach 

Although  time-domain  solutions  and  necessary  conditions  of  optimality  formulate  the  com¬ 
plete  solution  to  the  minimum-time  tracking  problem,  it  requires  considerable  computational 
effort  to  solve  the  resulting  nonlinear  problem.  What  we  propose  is  to  approximate  the 
problem  as  a  sequence  of  finite-time  tracking  problems  in  terms  of  linear  programming.  The 
finite-time  tracking  problems  are  formulated  in  terms  of  partial  fraction  expansions;  hence 
they  require  no  time-domain  simulations.  We  now  describe  the  procedure. 

What  we  mean  by  the  poles  of  a  function  of  time  are  the  poles  of  the  Laplace- transform 
of  the  function.  Note  that  a  signal  in  Ut  is  entire;  hence,  it  does  not  have  any  poles. 

Recall  that  for  a  given  minimal  (A,  B,  C)  ,  the  set  of  all  reference  signals  that  can  be 
tracked  in  finite  time  is  given  by  (1)  .  From  Fact  3.3.4  ,  we  conclude  that  the  reference  signal 
can  have  poles  at  the  plant  poles  and  nowhere  else. 

Let  r  €  'R-t  a^nd  the  plant  be  at  an  initial  state  Xo  at  <  =  0  .  Let  rr  and  Xr  describe  tlie 
unique  decomposition  of  r  as  in  Fact  3.3.4  .  The  goal  is  to  find  a  u  €  Ut  such  that 

Rt{s)^C{sI-  AY^Xr  =  C{sJ-AYYxo  +  BU[s))  . 

Since  signals  in  Ut  do  not  have  any  poles,  the  set  of  all  signals  that  achieve  ^’-tracking  is 
described  by  partial  fraction  expansion  matching  conditions.  From  now  on,  without  loss  of 
generality,  we  will  assume  that  xq  (the  initial  condition)  is  zero.  If  xq  is  not  zero,  one  can 
redefine  Xr  as  {xr  —  a'o)  • 


3.4.1  Proposition: 

Let  the  plant  P  have  the  expansion 


^00  =  EE 


where  Kij  (E  .  Let  r  t  R-t  ■  Consider  the  unique  decomposition 


^  J  T  T  J  ej.  ;>  , 

where  tt  E  Ut  and  Vcxp  —  Ce'^^Xr  (see  Fact  3.3.4)  .  Under  these  assumptions, 
L  I'cxj,  E  TZt  ;  moreover, 

RfX])  ( 'S  ) 

where  /?,j  G  Ill"°  . 

2.  The  set  of  all  signals  u  E  Ut  such  that  (B  *  u)  ^'-tracks  r  is  given  by 


k  m, 

=  EE 


A'o 


r 

P  ^ 

\  u  E  Ur 

1 

-.s,)-B(.s)/y(.s)] 

i  \  .  Y",  j  .□ 


('1) 
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3.4.2  A  Convex  Approximation  to  the  Minimum-Time  Tracking  Problem 

In  order  to  use  the  results  mentioned  above  in  a  finite-dimensional  optimization  problem, 
choose  a  basis  of  lA-r  (preferably,  one  for  which  the  Laplace  transforms  can  be  easily  calcu¬ 
lated)  .  Truncate  this  bcisis  down  to  a  finite  collection  for  a  specified  N  .  Consider 

the  subclass  of  signals  in  Ur  whose  Laplace  transforms  are  of  the  form 

VM  =  . 

t=l 


Clearly,  all  of  the  matching  constraints  in  (4)  translate  into  a  linear  equation  in  terms  of 
p  €  IR^  ,  say 

V{T)p  -  7  •  (5) 


Note  that  7  fc  HI”  ,  where 


k 

11  :=  Ho  •  ^  riii  . 


The  entries  of  7  are  solely  determined  by  the  entries  of  Kij  corresponding  to  Vexp  (see  Propo¬ 
sition  3.4.1)  .  We  will  refer  to  the  space  7  is  in  as  the  residue-space  (an  abuse  of  notation; 
after  all,  not  all  entries  of  7  correspond  to  the  residues  in  the  partial  fraction  expansion)  . 
For  the  specified  expansion  Up{s)  and  specified  T  ,  (4)  determines  a  unique  l^T)  .  Solv¬ 
ing  (5)  for  different  vectors  7  in  the  residue-space,  one  obtains  solutions  to  Vr  for  different 
admissible  reference  signals 

As  we  have  seen  before,  there  is  no  loss  of  generality  when  the  reference  trajectory  r 
is  expressed  in  terms  of  r^p  only.  The  finite  duration  portion  has  no  contribution  to  the 
matching  constraints  in  (4)  .  We  now  point  out  an  observation  that  may  further  simplify 
setting  up  the  problem.  Typically,  the  reference  r^xp  does  not  have  nonzero  coefficients  at 
each  and  every  single  pole  of  P  ,  i.c.,  not  all  entries  of  7  are  nonzero.  The  desired  reference 
is  usually  T-tracked  by  exciting  a  subset  of  the  j)lant  poles.  Note  that  if  none  of  the  plant 
poles  are  excited,  the  })lant  output  will  also  be  in  Ur  ■ 

Suppose  that  one  has  a  model  of  P  of  the  form 


P{s) 


I<n  ^  A. 


where  the  desired  performance  is  to  achieve  rigid  body  j)crformancc  with  minimal  residual 
vibration  due  to  the  rest  of  the  flexible  modes.  Note  that  the  results  mentioned  so  far  are 
quite  general,  not  specifically  derived  with  this  simplification  in  mind.  Carrying  on  with  the 
observation,  since  the  flexible  modes  are  distinct,  zero  residual  vibration  at  .s,  is  achieved 
if  f/(.s,)  =  0  ,  provided  that  the  plant  is  initially  at  rest.  Since  the  rank  of  Ki  is  nonzero 
(otherwis<;  the  modt;  would  ihjI  show  up  in  the  model).,  (J{si)  has  to  be  in  the  null-siiace  of 
h'i  .  Definitely,  chosing  U{si)  =  0  is  a  sufficient  condition;  however,  the  rc^stilts  considerably 
simplify  since  the  performance  boils  down  to  stej)  and  ramp  constraints  on  f/(.s)  .  'Jlu' 
null-space  description  for  tlie  ri^sidual  viliration  port'on  can  b<!  re[/laced  liy  setting  th(!  input, 
to  zero  at  those  jioles.  Clearly,  for  single  input  systems,  this  sullic.i<;nt  condition  is  also 
necessary.  Note  that,  wlien  one  seeks  an  injiut  that  steers  any  state  to  a  terminal  state, 
//(.s,)  will  not  neex'ssarily  l)e  zero,  in  order  to  supiuess  the  corresjunidiiig  mode. 


Provided  that  (5)  has  a  solution,  one  brings  another  way  of  solving  the  finite-time  tracking 
problem  Vt  •  Introducing  the  actuation  saturation  constraints,  we  end  up  with  a  convex 
feasibility  problem: 

Find  p  G  such  that  ^{T)p  =  7  and  H/J^/iHoo  ^  1  , 

where  h{t)  :=  ...  explained  later  on,  by  suitably  choosing 

the  finite  collection  of  functions  in  Ht  ,  the  convex  problem  becomes  a  linear  programming 
problem. 

Instead  of  solving  a  sequence  of  convex  feasibility  problems  and  seek  answers  of  the  form 
yes  or  no,  one  can  form  a  slight  variation  to  extract  the  most  from  each  optimization  run. 
For  a  fixed  T  and  N  ,  ■y  denotes  the  desired  direction  and  magnitude;  instead  of  solving  for 
7  solve  for  a  scaled  version; 

max  A  .  (6) 

p  G  A  €  111 
r(r)p  =  A7 
IIp^MIoo  <  1 

Solving  the  convex  minimization  problem  (6)  ,  one  obtains  the  maximum  performance 
along  the  direction  7  for  a  specified  T  and  N  .  Clearly,  one  can  then  solve  a  sequence  of 
convex  minimization  problems  by  varying  T  (and  N)  to  sweep  a  maximum  performance 
curve;  hence,  obtaining  an  approximation  to  the  minimum-time  problem.  In  the  rest  of 
this  report,  we  will  fix  A'  and  vary  T  .  The  implications  of  this  approach  will  be  discussed 
in  detail  for  the  specific  convex  setting:  linear  program.  Such  a  choice  is  made  only  for 
making  sure  that  the  complexity  of  the  sequence  of  convex  optimization  problems  stays  the 
same  throughout  the  whole  sweep.  If  one  is  not  interested  in  further  simplifying  this  convex 
program  (6)  to  a  linear  program,  one  could  apply  all  of  the  topics  discussed  under  the  linear 
program  sef  up  to  this  general  convex  setting. 

3.4.3  A  Particular  Choice  for  Up{s) 

A  time-optimal  input  signal  is  necessarily  bang-bang;  i.e.,  it  is  piecewise  constant  (with  values 
±1  ,  for  a  detailed  treatment  of  the  solution  technique  using  calculus  of  variations,  sec  e.g., 
[Athl])  .  While  there  is  no  upper-bound  on  the  finite  number  of  switchings  in  general,  for  a 
fixed  number  of  switchings,  say  N  ,  one  can  choose  a  sequence  of  N  jjulscs  (with  alternating 
amplitudes)  with  varying  widths  as  a  family  of  functions  in  Ur  •  One  can  then  solve  for  the 
constraints  in  (4)  to  obtain  a  family  of  nonlinear  (non-convex)  algebraic  equations  [.Schl]  . 
Similar  algebraic  equations  are  obtained  by  applying  the  Pontryagin’s  Maximum  Principle 
to  specific  Ccises  of  using  state-space  computations  [I3cnl,  Sinl]  .  Since  the  signals  (by 
construction)  ,  do  not  violate  the  specified  actuation  bounds,  solving  for  the  minimum  width 
pulse  sequence  brings  an  apiiroxiiriation  to  the  solution  of  the  mininmm-tiine  problem.  In 
order  to  avoid  local  inininia  ])roble.ins,  we  choose  a  linear  combination  (;f  functions  in  Uj-  . 

For  a  fixed  7’  ,  coiisider  the  weiglited  sum  of  a  sequence  of  N  ])ulse.s  with  uniform  widths 

jN)  ;  i.e.,  choo.se  f/p(.‘<)  as 
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(7) 


p’TIN  _  1  N 


C  denotes  the  Laplace-transform  and  1  denotes  the  unit-step  function.^  Since  the  input 
signal  is  piecewise  constant,  the  actuation  bounds  can  be  expressed  as 


where  p  G  is  to  be  determined.®  Recall  that,  the  general  convex  approximation  was 
in  fact  a  linear  equation  with  convex  constraint  on  the  parameters.  Now  that  the  input 
constraint  can  be  equivalently  represented  as  a  parameter  bound,  we  end  up  with  a  linear 
program  setup. 

Before  proceeding  with  the  sequence  of  linear  programs,  we  comment  on  the  choice  of 
N  . 

The  matching  conditions  in  (4)  yield  n  equations  in  N  unknowns.  In  order  not  to  have 
a  trivial  null-space,  N  should  be  greater  than  n  .  Note  that  the  null-space  solution  will  be 
used  to  find  a  solution  in  the  hypercube  —  1  <  p  <  1  .  If  the  time-optimal  switchings  were 
at  integer  multiples  of  T/N  ,  the  resulting  linear  program  would  return  a  solution  at  one 
of  the  corners  of  the  hypcrcube.  Hence  the  specific  approximation  as  a  linear  program  will 
consistently  come  up  with  solutions  trying  to  reach  a  corner,  but  not  necessarily  get  there. 
Clearly,  the  finer  the  pulse  width,  the  higher  the  number  of  pulses  that  attain  the  maximum 
bounds.  In  the  detailed  treatment  of  an  example,  we  will  comment  on  the  improvements  of 
the  solutions  with  respect  to  N  . 

There  is  another  way  of  interpreting  this  specific  choice  of  input.  Since  the  pulses  are 
uniform,  one  could  view  the  coefficient  matching  in  (4)  as  matching  the  output  of  the  discrete¬ 
time  plant  (namely,  P  sampled  at  NjT  Hz)  .  This  dictates  that  the  choice  of  N  should  be 
so  that 

lmag(s,)7’  <2nN  ,  i  =  l,...,A:  ,  (8) 

(preferably,  <g;  rather  than  <  )  ,  where  5,  denotes  the  ith  pole  of  P  . 

Typically,  one  has  energy  constraints  on  the  actuators, 

llnjl^  ^  Pmax  1 

where  E^ax  denotes  an  upper  bound  on  the  actuation  energy.  Note  that  this  brings  an  upi)cr 
bound  on  T  since  the  inputs  satisfy  llr^Hoo  1  • 

II  ^'^112  —  ^^,'7  ^  Emax 

Denote  the  upper  bound  on  7'  <us  Pmax  ■ 

Hence,  once  the  pole.s  of  P  are  determined,  one  can  choose  such  an  N  and  fix  the  order 
of  the  linear  program  throiighout  the  solution. 

*•  ^ '  •<')-=(  ?  ;  Itl  ■ 

*^r’or  ])  CF,  IR^  ,  —  1  <  p  <  1  (ioiiotrs  —1  <  pk  <  1  ,  for  1  <  ^’  <  yv  . 
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3.4.4  An  Approximation  by  a  Sequence  of  Linear  Programs 

For  a  given  plant  P  and  N  ,  the  maximum  performance  function 

^  :  (0,oo)  X  ni”  1R+ 

is  defined  in  terms  of  the  following  linear  program. 

:=  max  A  ,  (9) 

p  e  IR^,  A  €  IR 
r(T)p=A7 
-1  <P  <  1 

where  r(r)  is  obtained  by  the  matching  conditions  (4)  .  Any  p  €  IR^  for  which  the  linear 
program  in  (9)  returns  the  value  ^(T,7)  ,  is  used  to  define  the  relation  $  , 

^  :  (0,oo)  X  IR"  IR^  , 

where 

:=  P  . 

For  a  specified  time-instant  T  ,  ^(T,  7)  determines  the  maximum-performance  one  can 
achieve  along  the  specified  direction  7  .  For  a  fixed  direction  7  ,  by  sweeping  over  T  ,  one 
obtains  the  7-maximum-performance  curve 

(r,'j-(r,7))  ,  reiR+  . 

Instead  of  generating  the  7-maximum-performance  curve  for  T  €  IR^  ,  introduce  a  time- 
resolution  of  AT  and  discretize  the  curve;  i.e.,  for  a  pre-determined  kmax  1  evaluate  the  curve 
at  integer  multiples  of  AT  : 

(T,^(T,7))  ,  T  =  A;AT  ,  0  <  ^' <  Aw  • 

The  sub-optimal  (due  to  the  approximations)  time-instant  T  for  which  the  residue  7  is 
achieved  is  in  the  interval 

'r€[T,,T,-f  A2’]  , 

where 


'J'(T<,7)  <  1  , 

^(T,-hA7’,7)  >  1  . 


3.4.5  T-Polytopes  and  ^’-isochrones 

Consider  the  maximum-performance  function  defined  in  (9)  .  The  7’-polytope  <Sr, 

.Sr  {  A7  1  7  e  nr  ,  II7II2  =  1  ,  A  €  [0,  '1^(7’,  7)]  }  .  (10) 

is  tlie  set  of  all  points  in  tlie  residue-space  that  can  be  reached  in  T  seconds.  The  boundary 
of  this  set^  denoted  by  dSj'  >  i^  referred  to  a.s  the  T-isochrone. 

Let  Sx  denote  the  set  of  all  residues  that  can  be  reached  in  T  seconds  when  inputs  are 
restricted  to  bounded  (by  one)  signals  in  Ur  ;  i.e.,  the  7’-optimal-polytope.  In  other  words, 
St  is  “the”  complete  result  if  we  were  not  to  restrict  the  inputs  as  we  did  in  (9)  . 

Wc  have  the  following  simple  yet  crucial  properties: 
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•  Sx  C  St  • 

•  St  and  St  are  convex. 

Hence,  St  is  a  convex  approximation  to  St  . 


3.4.6  A  Simple  Approximation  to  St 

Consider  the  following  T-polytope  St  which  is  a  subset  of  St  in  (10)  . 

St  :=  (  EAi4r(J',ei)e,  I  Pll,  <l|  .  (11) 

f  i=l  J 

where  {ci ,  ...  ,  e„}  is  a  basis  in  HI”  . 


3.4.7  A  Map  from  the  Residue-Space  to  the  Input  Space 

Our  goal  is  to  construct  a  map,  possibly  using  look-up  tables,  such  that  given  a  desired 
reference  signal  and  the  states  of  the  plant,  an  input  signal  is  generated  so  that  the  plant 
output  ^’-tracks  the  reference  signal  as  fast  as  possible. 

The  desired  reference  signal  (which  should  be  in  TZt)  and  the  initial  state  determines 
the  amount  of  change  necessary  in  the  residues  of  the  output  signal.  For  this  reason,  we 
will  focus  on  the  following  subproblem:  For  a  given  7  in  the  re.sidue-.space  determine  an 
input  signal  such  that  the  output  of  the  plant  achieves  the  residues  specified  by  7  aus  fast  as 
possible. 

In  the  rest  of  this  report,  we  will  use  the  standard  orthonormal  basis  {ei ,  . . .  ,  Cn)  in  IR"  . 
In  order  to  cut  down  on  the  storage  space,  we  will  identify  St  with  its  “positive  orthant”: 


.Sr  :=  {  X:A,'k(r,e,)e.-  1 

f  i=l 


1  <  1  ,  A  >  0 


Let  the  signum  function  SGN  :  IR”  HI”  be  defined  as 

c[SGN(',)  •.=  •! 


1  if  ef  7  >  0 
0  if  cfj  =  0 
[  —1  if  eJ'j  <  0 


Let  the  operator  .*  denote  element- by-element  product  in  IR"  ;  i.e., 

4  (71  72)  :=  (er7i)(t;j72)  ■ 

Suppose  that  7  t  OSt  ■  Let  7  be  defined  by 


(12) 


7  SGN{j)  .  *  7  . 

Clearly,  7  £  dS-r  if  and  only  if  7  €  dSx  ■  Since  7  can  be  exi)ressed  as  a  convex  combination 

7  =:  ,  A>0,  ||A11,  =  1  , 
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we  conclude  that  the  input  that  achieves  7  residue  in  T  seconds  is  represented  by 

p  =  ■£hHT,ei)  . 

i=l 

The  input  that  achieves  7  residue  in  T  seconds  is  represented  by 

4=1 


Note  that 

- 1  <P<  1  . 

Since  the  input  is  assumed  to  be  piecewise-constant  over  TJN  second  durations,  suppose 
that  (with  respect  to  the  underlying  sampling  rate)  there  are  M  points  per  pulse.  For  the 
sake  of  illustration,  assume  that  n,'  =  1  ;  let  u  €  IR^  be  the  vector  will  all  entries  equal  to 
one.  The  input  sequence  that  is  compactly  represented  by  p  6  IR^  is  given  by  the  following 
sequence  (from  0  to  T)  u  €  IR^^  : 


u  =  p  Q  u  , 

where  the  operator  ®  denotes  the  tensor  product.  Hence  the  underlying  idea,  is  to  express 
a  given  residue  vector  7  as  a  convex  combination;  we  choose  to  do  this  on  one  of  the  T- 
isochrones. 

We  now  outline  the  procedure: 

1.  Fix  N  and  Tmax  ,  following  the  points  mentioned  above. 

2.  Determine  a  time  resolution  AT  .  Suppose  that  Nt  :=  rouiid(7'niar/A7’)  points  sweep 
the  desired  range  over  which  the  actuation  is  considered.  Note  that,  with  respect  to 
this  quantization,  for  any  T  G  (Tq  —  AT,  7o]  ,  7’-tracking  is  indistinguishable  from 
7o-tracking. 

3.  Solve  (n  •  Nt)  linear  programs;  i.e.,  for  the  Nt  time  points  solve  for  and  store 

^(T,e,)  ,  4»(T,e,)  ,  i  =  . 

Note  that  if  steady-state  to  steady-state  7'-tracking  is  to  be  desired,  one  need  not  span 
all  n  directions,  as  it  will  become  apparent  later  on  in  the  detailed  example. 

4.  So  far,  the  storage  sp<ace  needed  is  (n  •  Nt  ’  {N  +  1))  .  This  data  represents  the  Nt 
isochrones,  i.e.,  the  boundaries  OSt  ■  This  completes  all  preprocessing  that  is  necessary 
to  construct  the  desired  map.  Note  that  along  each  direction  e,  ,  we  have  reached 
out  as  much  as  possible.  Hence,  for  residues  which  arc  close  to  the  directions,  the 
convex  approximation  will  yield  the  best  possible  result  subject  to  the  linear  program 
approximation  to  the  time-optimal  problem. 
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5.  The  domain  of  the  desired  map  is 


<St 


That  is,  for  any  7  €  Srmax  >  will  generate  an  input  that  achieves  that  residue  as 
fast  as  possible.  Recalling  the  discussion  above,  we  will  identify  7  with  7  €  Srmax  •>  by 
keeping  track  of  5GA^(7)  ,  to  cut  down  on  storage  space,  - 

-  !  A' 

6.  For  a  given  7  determine  the  smallest  T  such  that  7  C  St  Wg  propose  the  following 
procedure:  Let  rj  €  IR^*'  be  defined  as  !■,  d  '  V 


Nt 


Let  the  iih.  entry  of  i)  be  the  first  entry  which  is  less  than  or  equal  to  one.  Since  7  was 
in  to  start  with,  such  an  t.  exists.  Hence,  we  conclude  that,  among  the  family  of 

T-isochrones,  7  can  be  achieved  no  faster  than  r<  seconds. 


7.  Once  the  index  t  and  the  multiplier  ejri  is  determined,  assign  p  e  IR 


N 


(13) 


Note  that  p  in  (13)  satisfies  —1  <  p  <  1  ,  moreover,  the  input  that  is  compactly 
represented  by  p  achieves  the  specified  7  residue  in  Ti  seconds,  the  fastest  among  the 
family  of  7’-isochrones. 

8.  Extract  the  input  sequence  from  p  .  Provided  that  there  arc  n,-  inputs,  we  have 

N  -.Nrii  , 


A 

for  some  integer  N  .  Partition  p  as 

P  “•  [Pl  •  •  •  Pn,]  1  Pk  G  IR  1  ^  ],...,  Tij 

As  mentioned  before,  (with  respect  to  the  sampling  rate  specified)  suppose  that  there 
are  M  points  per  pulse  width  (of  TefN  seconds).  Let  u  <E  IR^  have  entries  equal  to 
one.  The  input  waveform  over  Ttj N  is  represented  by 

U ^  —  p^  ®  XI  j  Ic  —  l,...,^,  , 


where  Uk  €  IR^^^  . 


3.5  Example 

Recall  that  none  of  the  results  stated  so  far,  put  any  constraints  on  (A,  B,  C)  .  In  order 
to  test  out  the  proposed  scheme,  we  focused  on  a  specific  siso  i)lant,  consisting  of  a  double 
integrator  and  three  damped  flexible  modes. 
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[q:i  Ci2  <13]  =  [5  10  20] 

[wi  W2  =  27r[0.5  1  1.5]  rad/s 
[C1C2C3]  =  [0.10.010.001]  . 

The  Bode-plot  of  P  is  shown  in  Figure  2  ;  note  that  the  specified  time  interval  corresponds 
to  excitations  with  frequency  content  ranging  over  the  band  [0.1, 1]  Hz  .  The  choice  of  the 
constants  had  no  specific  purpose  other  than  making  sure  that  the  flexible  modes  significantly 
contribute  to  the  position  output.  In  order  to  show  the  efficiency  of  the  proposed  scheme, 
we  will  focus  on  actuation  signals  in  Ut  ,  where  T  €  [1, 10]s  .  Standard  bang-bang  actuation 
relying  on  the  rigid-body  approximation  of  P  for  rest-to-rest  slewing  over  the  interval  [1, 10] 
s  resulted  in  unsatisfactory  outputs. 


3.5.2  Admissible  Reference  Signals  and  Matching  Conditions 

For  the  plant  model  in  (14)  ,  admissible  Vexp's  are  linear  combination  of  steps,  ramps  and 
lightly-damped  sinusoids.  The  vast  majority  of  tracking  problems  restricts  the  references  to 
to  a  linear  combination  of  steps  and  ramps: 

{  r  I  r{i)  =  To  4-  ri/  ,  To  €  IR  ,  ri  e  IR  ,  i  >  0  } 

In  terms  of  the  previous  notation,  we  have: 


m2  =  m3  = 


m  =  n<,  =  1  , 

Hs  =  n  =  8  , 

7max  =  10  , 

k  =  7  , 

mj  =  2  , 

•  •  •  =  rnr  -  1  , 


namely,  one  input,  one  output,  eight  states,  eight  parameters,  a  maximum  of  ten  seconds  of 
actuation,  seven  poles,  one  of  multiplicitj'  two  and  the  rest  of  multiplicity  one. 

Following  (8)  ,  the  constraint 


liiiag(s,)2'„,aj;  <  2nN  ,  i  —  I,..  .,k 


yields 


N  >  max{  0  ,  4.9749  ,  9.9995  ,  15  )  =  15 


20 


<PGw)  [deg.]  20log(abs(P(jw)))  [dB] 


60 


Figure  2:  Bode-plot  of  P  in  (14) 
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We  choose 

N  =  20  . 

All  simulations  were  performed  by  sampling  the  plant  P  at  100  Hz  .  In  order  to  have 
one  sample  instant  added  on  for  each  time  step  AT  ,  we  choose 

AT  =  0.2  , 

=  46  . 

In  order  to  set  up  the  linear  programming  problem,  we  need  to  construct  r(T)  from  the 
matching  conditions  in  (4)  .  For  this  example,  we  have 

5i  =  0  , 

S2  =  h  =  -0.3142 +i3.1258  , 

54  =  ss  =  -0.0628  +j6.2829  , 
se  =  S7  =  -0.0094  +j9.4248  . 


Expand  P  at  its  poles  (three  multiplicity  one  complex-conjugate  pairs  and  two  at  zero)  , 

p/  ^  4.  ^  4.  ^'2  ,  ^’2  kj  k4  ke  kQ 

'  s  (s  —  62)  {s  —  •S2)  {s  —  ^4)  (s  —  S4)  (s  —  se)  (s  -  se)  ’ 

(15) 


where 


ko 

=  0 

h 

=  1 

^2 

=  jO.7998 

k4 

=  iO.7958 

ke 

=  j  1.0610 

In  general,  admissible  reference  trajectories  r  will  have  the  expansion 


0/  \  ,  ^2  .  »'2  ,  C4  ,  ?'4  ,  re  ,  rc 

R[s)  — :  ^  ~i  \  "f"  7  ^  +  7  \  +  7  +  7 - 7  +  7 - ■  (16) 

s  {^-^2)  {^-^2)  {S-S4)  [s  -  S4)  {s  -  se)  (s-5o) 


Let  the  input  be  chosen  as  in  (7)  .  Using  the  fact  that  signals  in  Ut  do  not  have  any  poles, 
apply  the  matching  conditions  in  (4  to 


R{s)  =  P{s)U,{s)  , 


where  P  and  R  are  as  in  (15)  and  (16)  ,  respectively.  Equating  the  coefficients  of  the  partial 
fraction  expansions,  we  obtain 


kiUj,{0)  =  ri 

(17) 

^’oLV(O)  +  kibf\0)  =  7-0 

(18) 

k2Up{s2)  =  r2 

(19) 

^4f^p(^4)  ^4 

(20) 

keUpise)  =  7’f>  . 

(21) 
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Note  that  equations  (17  -  21)  describe  =  8  linear  equations  with  real  coelTicients  in 
=  20  unknowns,  namely 

r{T)p  =  7  [ro  ri  lleal(r2)  lmag(r2)  Real(r4)  Imag(7-.i)  Real(y6)  Imiag(r6)]^  . 

For  ease  of  notation  let  ones(l,7V)  denote  the  row  vector  whose  entries  are  all  equal  to  1  . 
Let  [1  ;  A^j  denote  the  row  vector  whose  ith  entry  is  equal  to  i  .  For  s  G  01  , 

where 

,4v0  :=^  - 

For  r  =  1  +  (m  -  1)A7’  ,  r(^')  €  is  given  by  : 

(T/N)M0.5-ones(l,N)  -  [1  :  A^])  1 
(77 Af)  -01108(1,  A) 

Rcal(^’2l/(S2/i')^H-5'2,  T)) 

\nydg{k2v{s2,T)(p{s2,T)) 

Rea.\{kMs4/T)<f>is^/r)) 

\mdg{k4y{s4,T)(f){fi4,  T)) 
llcd\{krMsG,  T)(l>{so,  T)) 

Imag(M%.'-^’)</H-‘>-G/-/’)) 

For  illustration  purposes,  we  will  assume  that  the  flexible  modes  arc  not  excited  and 
the  sequence  of  input  signals  acliicve  slewing  and  tumbling  maneuvers;  i.e.,  we  will  focus  on 
7’-tracking  of  steps  and  ramj's  only.  Such  an  assumption  is  made  so  that  the  steps  taken 
can  be  easily  comprehended;  by  working  on  a  two-dimensional  subspace  of  the  residue  sjiace, 
we  can  illustrate  the  7'-isochrones  and  the  7'-tracking  signal  generation  easily.  However,  as 
emphasized  throughout  the  study,  the  results  arc  g  eneral. 

We  now  solve  TVt  linear  programs  along  7  =  C]  and  another  N-y  linear  programs  along 

7  '-=  ^2  . 

The  ej-niaxiinurn-pcrforinaiice  curve  and  e2-inaximum-i>erforinan(;e  curve  are  shown  in 
Figure  3  . 


3.5.3  Cl -Maximum-Performance  Curve 


In  this  section,  we  will  evaluate,  the  cj -maximum-performance  curve  obtained  by  linear  pro¬ 
grams.  I'br  the  ])lant  model  in  (H)  ,  w<;  do  not  have  the  exact  “time-optimal”  ])(n'fornianee 
curve.  Hence  the  true  errors  introduced  in  the  approximation  cannot  be.  account(;d  for.  llow- 


(!ver,  a  very  useful  ecunjnu  ison  will  be  mad(;  iiistecul,  baser! 


mi  inu;  1  ignl- in;(iy  approAiiiial.ioii 


to  P 


Let 
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Note  that,  P  can  be  considered  as  a  perturbation  of  the  double-integrator  model  P  .  The 
ei-time-optimal-performance  curve  of  P  is  given  by 


upo{T) 


Recall  that  the  optimal  bang-bang  input  waveform  (for  P)  is 


u{t) 


'  -fl  ie[0,T/2) 

<  -1  te[T/2,T] 
0  i>T 


The  curve  upo  is  an  upper  bound  on  the  performance  of  P  .  In  order  to  rate  the  achieved 
performance,  consider 

0  Q7' 

lawyer)  :=  C-\-f  ■  (24) 

If  the  ei-maximum-performance  curve  lies  above  Iowq  we  can  conclude  that  P  at^^xieves  the 
performance  of  P  within  a  10%  relative  error  in  time.  In  <  ther  words,  if  P  can  7’-track  tq  , 
then  P  can  T-track  tq  where  T  €  [2’,  1.12’]  .  The  three  curves:  ei-maximum-performance 
curve  ,  upa  and  Iowq  are  shown  in  Figure  4  . 

Note  that  after  3  s  ,  the  ei -maximum-performance  curve  lies  in  between  xlpq  and  loW{)  . 
In  other  words,  for  T  €  [3, 10]  s  ,  the  20-paramctcr  linear  program  approximation  generates 
inputs  for  which  P  achieves  the  performance  of  the  rigid-body  approximation  P  within  10% 
relative  error  in  time. 

Before  proceeding  with  the  rest  of  the  results,  we  would  like  to  draw  the  attention  to  the 
region  2’  €  [1,3]  s  in  Figure  4  .  This  region  is  shown  again  in  Figure  5  . 

We  sought  an  explanation  for  the  deviation  over  this  region  shown  in  Figure  5  from  the 
rigid-body  i)crformance.  To  be  sjmcific,  we  focu.sed  on  the  rigid-body  performance  at  1  s  , 
which  is  achieved  by  the  approximation  after  2  s  .  Note  that  due  to  the  time-quanti'/.ation, 
current  result  vdll  achieve  ro  --  0.25  exactly  in  2.2  s.  In  order  to  see  the  improvements  in 
the  linear  program  aj^proximations  versus  the  parameter  N  ,  we  derived  the  Ci-maximum- 
performance  at  T  —  2  s.  The  rersuits  are  listed  in  Table  1  .  Even  for  lOO  pulse  approxima.tioii, 
the  performance  improved  by  only  4%  . 


r 

'k(2,ei) 

20 

0.23S4 

30 

0.2451 

40 

0.24 GO 

100 

^  0.2486 

'J'able  1:  ci-inaximum-perfoniiance  at  2  s  . 


As  another  approach,  we  iixed  /V  --  2U  and  decreased  the  timc-quantizalion;  Vq  was 
achieved  in  2.013  s  .  In  other  words,  the  20-pai  ameter  ajjproximation  generates  inputs  so 
that  tlie  plant  P  can  2. Old-track  the  0.25-step.  ’J'his  translates  into  a  102%  relative  error  in 
time. 
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In  order  to  account  for  this  102%  relative  error  in  time  ,  the  intrinsic  limitation  of  the  1  s 
maneuver  was  revealed  using  the  minimum  energy  solution  as  explained  in  Section  3.3.6  . 

Consider  a  state-space  description  of  the  plant  P  in  (14)  ,  where  the  position  is  the  first 
state.  The  goal  is  to  get  from  Xo  =  0  €  IR*  to  xy  =  [(^)^  0  ...  0]'^  ,  which  denotes  the 
rigid-body  performance  in  T  seconds.  Let  Vfc(0,  T(1  +£))  denote  the  controllability  gramian 
at  T(1  -t-  e).  Clearly,  if  there  exists  an  input  u  €  UT(i+e)  such  that  ||ul|oo  <  1  and  u  steers 
Xq  to  XT  ,  necessarily, 

[iyj^(0,T(l -t-e))]ii  <  — (1 -f-e)  ,  (25) 

where  [-Ju  denotes  the  (l,l)-entry  and  e  denotes  the  relative  error  needed  to  satisfy  the 
necessary  condition.  At  7’  =  1  s  ,  we  obtain  the  results  in  Table  2  . 


e 

[H/^:‘(0,(l-f£))]u/16/(l  +  £) 

0 

1.2278  +  10^ 

50% 

78.8319 

75% 

3.7966 

80% 

2.1550 

85% 

1.2437 

90% 

0.7323 

95% 

0.4421 

lable  2;  Necessary  condition  (25)  at  7’  =  1  s  . 

From  Table  2  ,  (which  are  evaluations  of  analytically  derived  necessary  conditions),  we 
conclude  that  there  exits  &  u  ^  Ut  that  steers  xq  to  xj  only  if  T  >  T  E  [1.85,1.90]  s  . 
Recall  that  we  arc  also  interested  in  an  input  ||u||oo  <  1  ;  oven  for  T  =  f  there  may  not  be 
such  a  bounded  input.  Regardless,  even  if  there  is  a  bounded  (by  one)  signal  that  achieves 
the  performance  within  1.9  s  ,  this  translates  into  a  90%  relative  error  in  time  which  is  an 
intrinsic  limitation.  Compared  with  the  111%  relative  error  obtained  by  20-parainetcr  linear 
program  approximation,  we  conclude  that  the  approximation  error  is  bounded  by  12%  . 
Intuitively,  0.5  llz  mode  imidies  that  a  2  s  slew  will  be  a  critical  slew.  Any  slew  time  slower 
than  2  s  will  affect  the  frequency  band  above  0.5  IIz  ,  namely  the  modes  of  the  plant. 

3.5.4  e2-Maximum-Performance  Curves 

These  results  bring  an  approximate  solution  to  the  fast  7'-tracking  of  a  moving  target.  Note 
that  in  the  cj  case,  namely  the  rcst-to-rest  slew,  the  goal  was  to  get  from  an  initial  state 
of  the  form  [ro  0  0  ...  0]^  to  [ro  +  <yro  0  0  ...  0]^"  .  In  other  words,  the  target  set  is 
fixed.  In  the  e^-case,  however,  the  goal  is  to  reach  from  [ro  0  0  ...  0]^  to  the  moving  target 
[^’o  +  ^'I'P  0  ...  0]^  as  fast  as  possible. 

Recall  that  the  cj-time-optimal-pcrformancc  of  the  double-integrator  P  (  namely,  7’- 
tiaekiiig  ijy  L)  ~  /■]/,  from  zero  sleaily-stiite)  is  given  by 


upii'I'] 


(1  +  v/2) 


?’i 


(26) 
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Note,  that  the  time-optinial  bang-bang  input  waveform  (for  P)  is 


u{i)  = 


'  +1  fe[o,Tx) 

•  -1  te[TuT] 
Q  i>T 


where 

r  .  ,  (2+v5) 

T,  r,  -  . 

The  curve  upi  is  an  upper  bound  on  the  e2-performance  of  P  .  In  order  to  rate  the 
achieved  performance,  consider 

ioMT)  :=  .  .  ■  (27) 

If  the  e2-maximum-performance  curve  lies  above  lowi  ,  we  can  conclude  that  P  achieves  the 
performance  of  P  within  a  20%  relative  error  in  time.  In  other  words,  if  P  can  T-track  r^t  , 
then  P  can  T-track  rit  where  T  €  [T,  1.27’]  .  The  three  curves:  e2-maxinium-performance 
curve  ,  upi  and  lowi  are  shown  in  Figure  6  . 

Note  that  after  3  s  ,  the  e2-maximum-performancc  curve  lies  in  between  upi  and  lowi  . 
In  other  words,  for  T  €  [3, 10]  s  ,  the  20-para.meter  linear  program  approximation  generates 
inputs  for  which  P  achieves  the  performance  of  the  rigid-body  approximation  P  within  20% 
relative  error  in  time. 

Similar  intrinsic  limitations  could  be  worked  out  for  this  performance  plot  in  Figure  6  . 
Note  that  for  [4.5, 10]  s  ,  the  20-parameter  approximation  has  at  most  10%  relative  error. 


3.5.5  St 

Recall  the  T-polytope  St  in  the  positive  orthant  of  the  residue-space  (see  (12))  .  For  the 
purpose  of  illustration,  consider  the  projection  of  St  onto  the  first  two  coordinates  of  the 
residue-space.  In  the  rest  of  this  section  wc  will  refer  to  St  and  the  7’-polytope  St  (sec  (10)) 
as  tlic  projections: 

St  St  n  {x  IR"  |  Xg  =  X4  •  •  •  =  a:,.  =  0  }  ,  (28) 

St  iSr  n  {x  €  111"  I  X3  =  X4  =  •  •  •  =  x„  =  0  } 

We  now  comment  on  the  approximation  St  of  St  for  the  specific  example. 

Consider  Figure  7  ,  showing  two  sets  of  boundaries  rcj)resenting  the  5-isochrone  and 
1 0-isochrone,  respectively. 

5-  and  lO-Isochrones  : 

Using  the  simple  A^j-linear  program  approximations  along  cj  and  C2  ,  respectively,  one 
obtains  Nt  isochrones  .  For  the  purpose  of  illustration,  we  focus  on  the  5-  and  lO-isochrones 
shown  in  Figure  7  .  Following  (28)  and  (12)  ,  the  poly  tope  Sr^  is  determined  by  the  boundary 
0  A  D  0  .  The  polytojjc  Sm  is  determined  by  the  boundary  0  K  ]!  0  .  In  order 

to  rate  tlie  effectiveness  of  the  apj)roxiniatioii  {St  api,>roxiinatiiig  Sr)  ,  we  solved  two  sets  of 
linear  programs  along  two  more  directions  other  than  cj  and  ;  specifically, 

4>(7',  [110  0...  0]'^') 
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1  2  3 


4 


9  10 


gurc  G:  C2-niaxiiiiuin-perfonnance  curve  (*)  ,  upi  (2G)  and  low\  (27) 
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and 

$(r,  [6100  ...  0]^)  . 

In  other  words,  we  obtained  the  maximum-peformance  curves  along  the  rays 

(0-5-F)  and  (0-?-G)  , 

respectively  (see  Figure  7)  .  Using  the  associated  four-breakpoints,  we  obtain  0  —  ^  —  B  — 
C  —  D  —  0  as  &n  approximation  to  «Ss  and  0  —  —  —  G  —  //— Oeisan  approximation  to 

Sio  (see  Figure  7)  .  Note  the  amount  of  improvement  in  each  case.  For  the  10~second  case, 
comparison  of  the  enclosed  areas  {0  —  E  —  H  —  0  versus  0  —  —  F  —  G  —  //  —  0)  reveals  an 

8.64%  improvement. 

Clearly,  by  introducing  more  directions,  one  can  obtain  successively  better  approxima¬ 
tions  to  St  .  Our  goal  was  to  generate  a  map  from  the  residue-space  into  the  input  space. 
In  the  process,  given  a  point  in  the  residue-space,  we  seek  a  pair  of  succsessive  T-isochrones 
for  which  the  point  is  outside  one  but  inside  the  next.  We  then  use  the  data  associated 
with  the  breakpoints  of  the  isochrones  to  determine  the  required  input.  Clearly,  the  one- 
hyperplane  approximation  St  is  the  simplest.  Introducing  more  hyperplanes  increase  the 
accuracy  of  the  T-polytope  approximation,  however  it  complicates  the  generation  of  inputs. 
A  multi-hyperplane  approximation  may  eventually  require  another  linear  program  solution 
to  determine  the  “cooordinatcs”  of  the  point  in  the  residue-space  in  terms  of  the  “corners” 
of  the  associated  r-polytope. 

3.5.6  A  Fast  T-Tracking  Example 

In  this  section  we  illustrate  Steps  6-8  in  Section  3.4,7  .  Assume  that  the  plant  P  in  (14)  is 
initially  at  rest.  Let  the  reference  trajectory  r  be  as  shown  in  Figure  8  . 

The  breakpoints  and  slopes  in  Figure  8  reveal  that  the  sequence  of  inputs  should  satisfy 
the  following  increments  in  the  residue-space  denoted  by  the  ordered  pairs  (ro,7'i)  : 

at  T  =  0  ,  (-1-2,  0) 

atr  =  5  ,  (-2,-1) 

atT=12  ,  (+7,-fl)  . 

Using  the  one-hyperplane  approximations  to  the  Nt  7’-polytopes,  the  fastest  tracking 
times  for  the  sequence  of  residue  increments  are  3,2  ,  4.6  and  7  s  ,  respectively.  Figure  9 
shows  the  input  and  the  associated  output  of  P  in  (14)  . 


3.6  Robustness  Considerations 

Consider  a  plant  P  of  the  form 

^'  =  ^  +  (JZT)  .  (29) 

where 

s  —  .So  -|-  ,  0<r<l  ,  0<^^<27r  , 

,  )?  =  (/ -1- A)A';  ,  ||A||,  <6  . 
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8:  Reference  trajectory  to  be  fast  T-tracked 
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In  other  words,  the  plant  model  P  is  “exact”  up  to  P  with  an  uncertainty  description  about 
the  pole  at  .Sq  ■  We  assume  that  the  location  of  this  pole  is  uncertain,  i.e.,  the  multiplicity 
of  the  pole  remains  the  same  in  the  region  of  uncertainty.  As  we  will  see  later  on,  the 
multiplicity  of  the  pole  (as  long  as  it  remains  the  same)  and/or  the  region  of  uncertainty 
about  the  pole  can  be  more  general  (i.e.,  multiplicity  can  be  greater  than  one;  the  region 
can  be  any  closed  bounded  simply  connected  region  about  the  pole)  . 

As  we  have  seen  before,  in  order  for  P  to  T'-track  the  reference  signals  of  interest,  P  is 
assumed  to  have  all  the  necessary  poles.  Our  concern  is  the  residual  vibration  due  to  the 
(possibly  uncertain)  poles  which  do  not  coincide  with  the  poles  of  the  reference  signal.  For 
example,  for  the  plant  model  in  (14)  ,  steps  and  ramps  would  be  the  set  of  reference  signals 
and  P  would  have  at  least  two  poles  at  s  =  0  ;  the  uncertainty  due  to,  say,  one  of  the  flexible 
modes  could  be  incorporated  as  in  (29)  ;  note  that  for  such  a  case,  there  would  be  another 
residue  term  due  to  sb  . 

The  crucial  point  is,  since  the  Laplace  transforms  of  signals  in  Uj  are  entire,  we  can  use 
the  property  of  harmonic  functions  to  bring  an  upper  bound  on  the  residual  vibration  due 
to  the  uncertainties.  Clearly,  for  any  signal  u  €  Ut  with  j'u||oo  <  1  , 

||t/('')(s)||  <  T*-— [1  -  e""^]  ,  ^'  >  0  ,  for  all  s  such  that  Real(s)  >  tr  . 
a 

Such  a  bound  is  too  conservative,  since  typically  we  will  be  dealing  with  a  weighted  sum  of 
a  spocilic  sot  of  input  signals. 

Let  Up{s)  denote  the  Laplacc-transform  of  a  family  of  signals  in  Ur  which  is  parametrized 
by  p  €  lll^  . 

One  approach  would  be  to  introduce  more  constraints  on  p  ,  in  terms  of  derivatives  of 
t/p(s)  .  Using  the  Taylor-series  expansion  about  sq  ,  imposing  the  constraints 

KdPJ\so)  =  Q  ,  j  =  0,...,(iV-l)  , 


for  some  A  >  1  ,  guarantees  that  the  residual  term  is 


k=N 


In  other  words,  for  any  s  ,  the  norm  of  the  residue  of  the  outjjut  at  s  ,  cannot  exceed 


11(7  + A)Ao/?.;vll  • 

Note  that,  since  Up{s)  is  entire,  it  cannot  vanish  for  ail  .s  unless  it  is  identically  zero  for  all 
■s  .  Such  an  a])proach  would  increase  the  number  of  constraints  on  p  ;  however,  the  resulting 
problem  is  still  a  linear  program,  hence  the  previously  proj)osed  niothods  still  aj^ply. 

Another  approach  gives  up  the  linear  program  sotting  and  ends  uji  with  a  convex  mini¬ 
mization  problem  by  minimizing  the  wor.st-casc  residue.  Since  Lp(.s)  is  iianiionic,  for  a  closed 
bounded  siniply-connccted  region  U  about  .Sq  ,  we  have 


sup  ||(/,X.s)||  =  sup  j|f/,,(,s)|| 
•len  rm 
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where  dCl  denotes  the  boundary  of  the  region  fl  9  so  •  Discretize  the  boundary  so  that 

dCl  «  '•=  {-Si  I  t  = 


For  the  specified  direction  7  in  the  residue-space,  obtain  the  maximum-performance  for  the 
nominal  plant 

P  + 

(s-so)  ’ 

as  explained  in  the  previous  sections.  In  other  words,  solve  for 


A  =  'lr(r,7) 

p  =  HT.'t)  . 

The  linear  contraints  on  p  take  into  account 

Aof/p(^o)  —  0 

For  this  A  the  norm  of  the  change  in  the  residue  over  0,  is  bounded  above  by 

K  :=  ||(/-h  A)Aol|rnax|lC/p(s)|l  . 

A 

Provided  that  k  is  not  desirable,  choose  A  <  A  (i.e.,  give  up  from  the  performance;  in 
addition,  one  may  also  increase  A^)  and  solve  the  following  convex  minimization  problem: 

min  rnax  i|t/p(.s)ll  , 
p  €  IR.^ 
r{T)p  -  At 
-1  <  p  <  1 


Using  the  global  minimum  values  (for  different  A’s)  ,  one  could  then  see  how  much  the 
original  performance  A  has  to  be  degraded  in  order  to  have  a  desired  residual  performance. 


3.7  Closing  the  Loop 

In  the  previous  sections,  we  proposed  a  method  of  generating  an  input  signal  u  ^  Ut  , 
that  achieves  fast  T-tracking  of  the  desired  7  subject  to  the  actuator  saturation  constraints. 
By  solving  a  sequence  of  linear  programs,  a  look-up  table  is  generated  over  a  region  in  the 
residue-space.  This  look-up  table  is  incorporated  in  a  “signal  generator”  which  is  used  to 
drive  a  unity-feedback  system.  We  now  describe  the  procedure. 


3.7.1  Signal  Generator 

Consider  the  interconnection  in  Figure  10  showing  the  proposed  “signal  generator”  . 

The  desired  reference  signal  is  represented  in  the  residue-space  by  7^  .  Recall  that  for 
a  given  plant  model,  the  set  of  admissible  reference  signals  that  can  be  7'-tracked  can  be 
characterized  by  a  partial-fraction  expansion  at  the  plant  poles.  The  coefficients  of  this 
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^ure  10:  Signal  Generator  , 

expansion  are  the  entries  of  7r  .  In  order  to  represent  a  series  of  reference  signals,  from  now 
on,  7  in  the  residue  space  will  be  identified  with  a  waveform  7  :  111+  —*■  IR”  .  Note  that  all 
three  versions  (of  7)  in  Figure  10  will  be  piecewise-constant  waveforms. 

Whenever  there  is  a  jump-discontinuity  in  7,  ,  the  “reset”  command  initializes  the  clock 
in  and  tvi  rs  the  “HOLD”  block  to  hold  the  value  of  the  plant’s  state  x  at  the  time 
of  “reset”  .  Once  the  state  x  is  held  at  x/,  ,  the  contribution  cf  x.i,  in  the  residue-space 
is  computed  by  multiplying  x/,  with  the  suitable  Partial-Fraction  Expansion  CoefTicients 
(denoted  by  the  i  iock  “PFE  COEFFS”)  .  The  difference  between  7^  and  determines  the 
command  7  , 

The  block  denoted  by  T  implements  the  proposed  scheme  in  Section  3.4.7  .  For  a  given 
command  7  ,  the  fastest  T-tracking  input  u  is  generated  from  the  condensed  data  in  the 
look-up  tables.  The  “reset”  input  synchronizes  the  clock  in  T  in  order  to  generate  the  input 
waveform  u  from  a  data  point  in  .  The  look-up  tables  in  T  corresponds  to  the  nominal 
plant  model  Pq  .  As  explained  before,  specific  uncertainly  descriptions  about  the  nominal 
plant  model  Po  can  be  incorporated  during  the  pre-computations  to  generate  the  look-up 
tables. 

One  of  the  many  drawbacks  of  an  open-loop  implementation  is  sensitivity  to  disturbances. 
Consider  the  model  in  Figure  II  ;  the  SATo,  block  accounts  for  the  saturation  in  the 
actuators. 

Clearly,  when  the  input  disturbance  is  zero,  the  signal  generator  configuration  can  be 
used  to  determine  the  desired  u  for  the  desired  y  .  For  a  bounded  disturbance  0  ,  the 

nonzero  tracking  error  may  even  be  unbound.cd  when  P  is  unstable. 


3.7.2  Unity-Feedback  System 

Once  the  desired  it  and  y  are  determined,  these  signals  are  used  in  the  unity-feedback  con¬ 
figuration  in  Figure  12  .  We  now  describe  a  conservative  design  }jrocedure  for  determining 
a  compensator  G  .  Some  of  the  steps  are  illustrated  with  a  sainjjle  design  vdicre  P  is  taken 


•'I'br  a  given  «  >  0  ,  SATa(  )  :  III”'  — >  III'''  ,  eJ’.S'd7'„(n) 
Note  that  ,SATcr(it)  =  ttSATi(^u)  . 


—a  ,  tt  <  — tt 

el  u  ,  |e(  a|  <  «  I-  =  1 , . .  . ,  n,  . 

n  ,  elu  >  n 
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Figure  11:  Plant  model 

as  the  model  in  (14)  . 

Typically  the  model  P  in  Figure  11  is  not  the  same  as  Pq  in  the  signal  generator.  Our 
goal  is  to  design  a  C  such  that  certain  disturbance  rejection  and/or  robustness  characteristics 
are  achieved  by  the  closed-loop.  The  tracking  specifications  are  totally  ignored  during  this 
design  procedure  since  the  fast  T-tracking  is  achieved  by  the  (u,y)  pair. 

In  the  rest  of  the  section,  we  assume  that  the  nominal  plant  model  Pq  in  the  signal 
generator  and  the  plant  model  P  in  the  unity-feedback  are  the  same;  moreover,  the  ini¬ 
tial  conditions  are  identical.  The  disturbance  d,„  comes  in  after  the  saturation  block  (see 
Figure  12)  . 


Figure  12:  Closed-loop  implementation 

Note  that  the  signal  generator  in  Figure  10  does  not  have  a  SATc  block  prc-cascadcd 
since  the  output  of  p  is  always  within  the  bounds  (by  construction). 

For  the  closed-loop  system  in  Figure  12  ,  over  the  operation  region  for  which  l|u||oo  <  o:  , 
the  closed-loop  map  from  (Ti,y,</,„)  to  u  is  given  by 

u  =  {l  -VCPy^ii  -f-  C{l-VPC)-^y  -  C7^(/-f  CP)-'d,„  . 

Provided  that  y  —  Pu  ,  we  obtain 

u  =  u  -  CP{I  -h  CPy^di^  . 

Clearly,  when  din  =  0  ,  we  have  u  =  u  hence  the  nominal  closed-loop  performance  achieves 
7’-tracking.  Note  that  this  is  true  for  any  stabilizing  C  with  ||u||oo  <  «  •  This  brings  up 
a  trade-off  issue;  smaller  the  contribution  of  din  u  meant)  fa&tei  7'-tiuL.l\iug  and  giealer 
sensitivity  to  din  fbe  plant  output  y  . 

Let  the  input  disturbances  be  such  that 

lldmiloo  <  0.25  .  (50) 
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Suppose  that  o  =  2  and  50%  of  the  actuation  authority  is  to  be  used  for  disturbance  rejec  ion. 
This  specification  requires  that  a  stabilizing  C  satisfies 

\\CPiI  +  CP)-^\U  <  4  ,  (31) 

where  ||  •  ||i  denotes  the  £oo-Sain.  By  generating  the  look-up  taoles  so  that  ||it|loo  <  1  ,  using 
(30)  and  (31)  ,  we  obtain 

Nloo  <  2  . 

Hence  we  have  a  conservative  design  since  the  SATa  block  acts  as  the  identity  map  over  the 
operation  region.  Note  that  the  I’-tracking  portion  (i.e.,  the  linear  program  solutions  and 
determining  the  signal  generator)  is  a  totally  separate  procedure  from  the  compensator  design 
once  the  actuation  authorities  are  allocated  for  fast  T-tracking  and  disturbance  rejection, 
respectively, 

A  stabilizing  compensator  C  is  designed  for  which  (31)  holds.  We  now  use  the  (u,  y)  pair 
in  Figure  9  to  illustrate  the  obvious  advantage  of  a  closed-loop  design.  A  disturbance  signal 
satisfying  (30)  is  shown  in  Figure  13  ;  din  has  a  DC  component,  the  signal  after  10  s  is  a  sum 
of  four  sinusoids  at  0.1  ,  0.5  ,  1  and  1.5  Hz  ,  the  last  three  at  the  modal  frequencies  of  P  . 
Figure  13  also  shows  the  open-loop  implementation;  P  *  {u  +  din)  introduces  a  considerable 
tracking  error. 

Figure  14  shows  the  closed-loop  signals  u  in  y  (see  Figure  12  ;  u  ,  y  and  arc  as 
mentioned  above)  ,  The  tracking  error  {y  —  y)  is  shown  in  Figure  15  . 


P*{u_hat+djn) 
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Figure  14:  u  and  y  in  Figure  12  {C  stabilizing  for  ]|u||oo  <  2  ;  C  satislics  (Ill)  ;  and 

P  as  in  Figure  13  ;  ?/  Pii  ) 
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Adaptive  Proximate  Time-Optimal 
Servomechanisms:  Continuous  Time  Case 


M.  L.  Workman^  R.  L.  Kosut^*^,  G.  F.  Franklin^ 


Information  Systems  Laboratory 
Stanford  University,  Stanford,  CA  94305 


Aislracl — A  Proximate  Tiinc~Optin)al  Servo  (PTOS)  is 
developed,  along  with  conditions  iibr  its  stability.  An  al¬ 
gorithm  is  proposed  for  adapting  the  PTOS  (APTOS)  to 
improve  performance  in  the  face  of  uncertain  plant  pa¬ 
rameters.  Under  ideal  conditions  APTOS  is  shown  to 
be  uniformly  asymptotically  stable.  Simulation  results 
demonstrate  the  predicted  performance. 

1.  Introduction 

In  many  automatic  control  systems,  it  is  desirable  to  effect  a 
minimum-time  response  to  sot  point  changes.  The  time-optima/ 
Control  is  a  non-linear  function  of  the  plant  states  and  requires 
precise  knowledge  of  the  plant  model  (!].  An  excellent  compilation 
of  related  work  is  contained  in  Oldcnburger  (2],  When  the  plant  is 
not  precisely  known,  the  time-optimal  control  law  caji  be  adapted 
to  clianges  in  the  plant.  In  this  paper  we  will  desa'ibe  an  adaptive 
proximate  time-optimal  controller  whicli  aside  from  being  af  laptive, 
is  more  practical  than  the  ideal  time-optimal  controller. 

In  the  following  section  a  non-linear  controller  is  proposed  wliich 
is  more  practical  than  the  tiine-optitnal  controller,  and  as  will  be 
shown  in  a  later  section,  can  be  very  close  to  time-optimal.  In  the 
tliird  section,  a  theorem  is  given  which  guarantees  stability  of  the 
practical  time-optimal  controller  under  reasonable  constraints.  A 
specific  controller  is  then  shown  to  me^-(  the  stability  constraints, 
and  is  also  shown  to  be  close  to  the  timc-o]>tiiital  solution  under 
reasonable  assumptions.  Adaptation  of  this  proposed  cotitrollcr 
is  the  topic  of  the  following  section,  including  a  theorem  which 
has  been  proved  showing  the  uniform  asymptotic  stability  of  the 
adaptive  non-linear  controller.  Finally,  the  last  section  illustrates 
via  simulation  the  improved  performajicc  afforded  by  the  ad.aptive 
non-linear  controller  over  the  fixed  parameter  iion-linear  controller. 

2.  Control  Design:  Known  Parameters 

Consider  the  plant  shown  in  Figure  2.1,  a  double  integrator 
driven  by  a  limiter  or  saturation  block.  The  equations  describ¬ 
ing  tills  system  arc 

if  ~  a  sat(u)  (2.1 ) 
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II 

(2.2) 

where  the  “sat"  function  is  defined  as 

.  f  +1  . 

z  >  1 

sat(z)  =  ^  , 

Pl<  1 

(2.3) 

1  -1  . 

2  <  -1 

Tlie  system  state  x  is  defined  as 

'T'  A  ,  , 

^  =  [y  v]  (2.4) 

For  time-optimal  control,  the  objective  is  to  minimize  the  time 
required  to  transfer  the  system  from  an  initial  state  (j/o  tJto]  to 
a  final  state  (r  O],  where  r  is  a  coastant  output  reference  or  set 
point.  The  lime-optimal  control  (l)  for  the  system  given  by  (2.1) 


Figure  S.l:  Double  integrator  plant  with  bounded  control, 
and  (2.2)  is 


u  -  Sgn(/, o(yc)  -  u) 

(2.5) 

JtoiVc)  =-  Kgn(j/tf)(2a 

(2.C) 

A 

ye  =  ^  -  y 

(2.7) 

and  the  signuin  funciion  is  defined  as 

f  +I  ,  z>  \ 

SR"(-)  =  S  0  .  2  =  0 

1  -1  ,  2  <  1 

(2.8) 

*lhc  sal  funciion  of  the  plant  is  imposed  by  some  pliysical  con¬ 
straint  such  as  power  &ui>ply  vol'.age.  Combining  the  control  law 
given  by  (2.5)  and  (2.7)  yields  the  following  description  of  the  tiine- 
optiina]  control  system 

y  -  V 

(2.9) 

ii  =  a  sgii(/,<,(v,)  -  v) 

(2.10) 

Jta(yr)  -  !>gn(y,)(2a  jy.-l)’/''  (2.11) 
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The  control  Uw  given  by  (2.5)-  (2.7)  eavd  shown  in  Figure  2.2, 
although  optimal,  is  not  practical  in  many  cases.  Even  the  small¬ 
est  system  process  or  measurement  noise  will  cause  the  control 
to  “chatter"  between  the  maximum  and  minimum  values  [3].  Re¬ 
moving  the  iniinite  gain  operators  Irom  the  time-optimal  controller 
gives  the  system  a  finite  bandwidth,  and  hence  is  much  more  prac¬ 
tical. 


fijsre  S.i:  Time-optimal  controller  for  double  integrator  plant. 


A  proximate  time-optimal  servonieclianisin  (FTOS)  is  sliowii  in 
Figure  2.3.  Tlie  function  /(■)  is  a  finite  slope^  approximation  to  the 
switcliing  function  /(o(-)  given  by  (2.11).  The  sigmim  function  of 
(2.10)  has  been  replaced  by  the  "sat"  function  which,  together  with 
the  gain  factor  can  be  thought  of  as  a  finite  slope  approximation 
to  the  signum  function.  The  equations  wliieh  define  this  "practical" 
version  of  (2.9)-(2.1l)  are 

V  =  u  (2.12) 

t>  =  a  sal(<.-2(/(y<)  -  v])  (2-13) 

where  the  function  /(•)  is  as  yet  unspecified. 

Without  affecting  the  stability  analysis  (for  step  responses),  wc 
will  drop  r  from  the  equations  and  analj-zc  the  system 

il-v  (2.14) 

ii  =  a  sat(/:j(/(-y)  -  v])  (2.15) 


Figure  2.J;  Proximate  Tiine-0])tinial  controller.  Linear  region  of 
tlic  saturation  function  has  gain  /.’2. 


3.  PTOS  Stability 

III  this  section  we  establish  restrictions  on  the  function  /(■)  and 
the  gain  k2  which  will  guarantee  stability  of  the  system  (2.14)  - 
(2.15)  depicted  in  Figure  2.3  for  step  inputs. 

Tiieoreiii  1  (Step  /nyufs),  77ie  zero  solutiun  vj (2. i ami  (2. 1  a) 
is  gloiaUy  asymptciically  siabU  if  the  following  conditions  hold: 


^The  derivative  of  fto  is  infinite  at  y  —  0. 


At)  akg  >  0. 

AS)  /(O)  =  0. 

AS)  f(z)z>0,V^ytO. 

Ad)  S‘  mdS  =  00. 

AS)  /'(y)  =  df(y)/dy  crisis,  V  y. 

A6)  -a  j^/'(-y)  <  /'(-v)/(-S/)  <  »  -  i^/'(-y)  .  V  y. 

PreoJ.  Dei&ils  of  the  proof  arc  contained  in  (4,5]  and  will  not  be 
repeated  here.  Instead,  an  outline  of  tlic  approach  taken  in  the 
proof  will  he  given.  Tlic  proof  consists  of  three  parts.  Referring  to 
Figure  3.^1  it  is  first  shown  tliat  all  trajectories  originating  outside 
the  region  XJ  C  will  enter  U  in  a  finite  time.  Second,  it  is  shown 
tliat  subject  to  >11  tlirough  ^46  trajectories  in  U  remain  inside  U  . 
Tliird,  it  b  shown  that  there  exists  a  Lyapunov  function  for  the 
system  when  x  6  U.  The  regions  in  state  fip^u:e  correspond  to  the 
condition  of  the  control,  unaaturaiid  (U),  saiuro.tcd  positive  (S+), 
and  saturated  negative  (S_),  and  are  defined  as  follows: 

u  =  {(y.u)  6  R2  ,  /(_J,)  _  ^  <  „  <  /(..y)  +  L'j  (3.16) 

s+  =  {(y,  v)  e  :  *--2(y(-v)  -  v)  >  l}  (3.17) 
S-.=  {(y,u)€R^:  *.-2(y(-y)- V)<  -l}  (3.18) 


Figure  3.4:  Regions  U  ,  S+  ,  and  S_  of  the  phase  plane.  Tlic  solid 
curve  rcj>rescnl5  the  points  (y,  v)  such  that  v  =  n-y). 


3.1  An  Interesting  /(■) 


In  this  Kcclioii  wc  propose  a  function  /(•)  which  meets  t)ic  condi¬ 
tions  of  Thcoreni  1  atid  still  produces  nearly  time-optimal  response 
to  clianges  in  the  position  reference  input  r.  The  following  choice 
for  /(•)  is  easy;  to  motivate:  build  a  function  /  nc^u•  to  fto  sudi 
that  the  resulting  system  trajectory  requires  less  than  the  niaxi- 
inum  acceleration  caj)ability  of  the  system,  a. 


five)  - 


{ 


‘■6'>(y<:)l(2uOr  Ivcl)’/^  - 


for  |ye|<  yi 
for  |ye|>  VI 


(3.19) 


The  linear  portion  of  /(•)  connects  the  two  disjoint  halves  of  the 
iion-Uncar  portion.  To  connect  the  non-lineau*  regions  of  /(•)  sucji 
that  /(•)  remains  continuous,  wo  liave  a  constraint  on  the  gains 
and  k2, 


ki  < 


oaA'2 

2 


(3.20) 
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and  a  Constraint  on  the  size  of  the  linear  region 


— (-V 

2or<x  \k2  J 


<  Vl 


<-m 


aa 

V 


(3.21) 


We  can  choose  as  a  function  of  k2  so  that  /*(•)  is  continuous. 
If  /'  is  continuous,  tlien  the  linear  portion  of  /(■)  is  tangent  to 
the  non-linear  s<)uare  root  portion  at  tlie  point  y  =  V(  and  smooth 
control  results.  Thus  for  continuous  /(•)  and  /'(•),  wc  have 


^2 


and  the  linear  region  is  just 


(3.22) 


y,  =  ^  (3.23) 

K1 

Whatever  the  cltoices  for  ki,  kn,  and  yj  in  (3.19)  (subject  to  (3.20) 
and  (3.21)),  we  must  verify  that  the  conditions  of  Theorem  1  are 
met.  It  is  clear  by  inspection  that  the  first  five  conditions  arc 
met  by  (3.19),  and  we  will  check  condition  A6.  The  derivative 
‘V(y«)/dy<:  is 


/'(Vc)-( 

t.  (2aay«)‘/’  ' 

//'  yields 

/(ye)/'(ye)  =  ( 


Calculating  //'  yields 


0  <  y«  <  yi 
y«  >  Vl 


Ve  <  Vi 
Vc  >  VI 


(3.24) 


(3.25) 


Given  //',  it  is  easy  to  show  that  (3.19)  meets  the  conditions  of 
Theorem  1  if  0  <  o  <  1. 

The  question  wliich  now  arises  is,  how  close  to  time-optimal  is 
the  proposed  system  (/(•)  is  as  given  by  (3.19))7  Consider  the 
limiting  case:  let  — *  oo  and  a  get  arbitrarily  close  to  1.  If  we 
consider  time  to  the  target  as  approximated  by  tin\e  to  the  point 
yi,  and  an  upper  bound  on  the  size  of  r,  then  the  time  to  the 
target  of  our  practical  system  will  approach  from  above  that  of  the 
time-optimal  system.  Tliis  meaiui  that  for  unrestricted  values  of 
k2,  we  can  get  arbitrarily  close  to  the  time-optimal  system.  F'or 
the  system  to  remain  practical  there  are  restrictions  to  the  size  of 
k2,  and  these  restrictions  will  be  addressed  in  Section  3.3. 

In  the  following  analysis,  it  is  helpful  to  define  the  distance  L 
which  a  repositioning  from  ro  to  r  covers; 


1/  =  Ir-  ro|  (3.20) 

For  the  time-optimal  system  let  <o,,(  denote  the  time  it  takes  to 
move  a  distance  L  to  the  new  target  position.  For  a  plant  initi.ally 
at-rest  we  have, 

topt(L)  =  (3.27) 

V“ 

ForPTOS,  extremely  small  moves  (chai\ges  in  set  point)  will  leave 
the  system  state  in  U  ,  and  the  time  to  a  given  distance  (tJlowablc 
enV)r  tolerence)  from  the  set  point  can  be  found  from  a  linear  anal¬ 
ysis.  For  large  set  point  clianges,  values  of  l/kj,  the  linear 

part  of  the  response  will  be  neglected.  Alt  hough  the  PTOS  accel¬ 
erates  as  fast  as  the  time-optimal  one,  the  deceleration  is  slower. 
The  time  elapsed  in  moving  from  point  y  =  a  to  point  y  ~  i  at  a 
velocity  which  can  be  written  as  a  function  of  position  is 

,  1  .  ,  . 
time  Horn  a  to  t  —  j  - ny  (3.28) 

Ja  ‘'(V) 

The  velocity  of  the  PTOS  is  approximated  by 

(2oa  \  sgn(y,)  (3.29) 


dm-ing  deceleration.  Defining  fpion  as  the  practical  controller  move 
time,  we  obtain  the  following  approximation  for  the  move  time  for 
the  practical  system; 

tp,o,  =  -^\/L  (3.30) 

Va  ^/aa 

We  can  define  P  zis  the  percentage  increase  in  move  time  of  the 
practical  controller  as 

P  g  (3  31) 

Substituting  (3.27)  and  (3.30)  into  (3.31)  gives. 

This  is  an  exciting  result,  in  tliat  the  move  time  percentage  increase 
is  not  a  function  of  L,  the  move  length.  Although  reasonable  values 
of  or  are  usually  such  that  cr  €  [.9  .99),  P  is  plotted  for  a  wide  remgc 
of  Or  in  Figure  3.5. 


FigvTt  3,5:  Percentage  increase  P  in  move  time  versus  cr,  Tl\c 
point  marked  on  the  curve  represents  P  =  1 .3  %  for  a  =:  .95. 


3.2  Designing  the  Linear  Controller 

Using  a  pole  placement  method  [6,7],  we  can  pick  the  control 
gains  to  effect  a  desired  closcd-looj)  transfer  function.  When  a  is 
known,  clioosing  ki  and  k2  as 

ki  -  wl/a  (3,33) 


^2  =  2C(iW^i/a 

wiU  yicl<3  the  following  closed-loop  transfer  function 

Y{^)  _  wg 

Ii{s)  +  2tdUlj(S  + 

Of  course  tlic  kj  and  k2  must  meet  the  constraints  (3.20)  and  (3.21 ) 
when  using  (3.19)  as  /(•)  in  the  practical  time-optimal  controller. 
Substituting  (3.33)  and  (3.34)  into  (3,20)  gi  ves  a  lower  hound  on 
the  damping  ratio  as  a  function  of  a: 


(3.34) 

(3.35) 


<d  > 


(3,36) 


The  choice  of  ljj  —  \/k’i  n  is  liOJli  peffonTi;4Iire  2*e]»<ed  of 

practical  concern:  higher  baiidwidlh  gives  faster  decay  of  position 
errors,  but  high  bandwidth  necessitates  that  the  model  of  our  plant 
be  accurate  well  above  Model  uncertajnty  will  be  covered  in 
Section  3.3, 
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3.3  Unmodeled  Dynamics 

The  stability  anej.ysis  of  Section  3  piovidcs  us  with  a  set  of 
design  rules  for  the  double  integrator  system.  However,  it  is  well 
known  that  all  systems  have  additional  dynamics  whicli  arc  usually 
eithei-  neglected  or  unknown. 

Initially,  we  will  not  constrain  the  mimodclcd  pai-t  of  the  plant 
to  be  linear  or  time  invariant.  Stability  of  the  open  loop  unmodelcd 
system  will  be  the  only  constraint.  The  complete  controller  block 
diagram  including  unmodelcd  dynamics  and  an  input  disturbance 
d  is  shown  in  Figure  3.6.  From  Figure  3.6  we  have: 

i)  =  aA{  sat(u)  +  d]  (3.37) 

y  t=  V  (3.38) 

Since  we  are  trying  to  examine  the  elTects  of  the  unmodelcd  dy¬ 
namics  and  disturbances  on  the  control  law  designed  in  Section  2, 
we  have  the  seune  control  law  (taking  A  into  account  wheti  design¬ 
ing  the  control  law  makes  A.  modeled  ,  and  this  is  considered  in 

M): 

u  =  hUi-y)  -  "1  (3.3U) 

The  block  diagram  of  the  complete  system  represented  by  (3.37) 
tlu-ough  (3.39)  is  shown  in  Figure  3.6. 


d(t) 


Figure  3.C:  Complete  servo  system  block  diagram  with  unmodelcd 
dynamics  and  disturbance  input  d. 

hi  the  following  theorem,  we  will  need  to  dolinc  an  operator 
which  gives  the  maximum  inagnitudo  of  a  function  over  .all  possible 
bounded  inputs: 

^^(^{(j})  =  sup  1A{ct)|  (3. .10) 

It  is  convenient  to  define 

A]  A  -  1 

As  will  be  discussed  later,  using  this  operator  (700)  fi>r  detennin* 
ing  peak  amplitudes  is  correct,  but  yields  czirrmcly  con5rrvativ< 
conditions  for  stability.  Tlx*  following  ihooreni  places  liniils  on 
both  the  \inmodcled  dynamics  and  the  disturbance  inputs  which 
are  s-uj]icicni  to  guarantee  stability  of  the  following  syst<'in  (see 
Figure  3.6): 

Thooroiii  2  {.Step  Inputs) .  The  zero  solution  oj  (‘i.ITl )  and  ) 

with  control  (.3.39)  is  globally  stable  xj  the  Jolloxniug  conditions 
hold: 

At)  -AS)  oj  Thcorcni  1,  and 

AC)  I  /'(-y)/(-y)  i<  all  -  'rco(A,  {  .al(u)  -t-  ■/})  -  ||c/|U]  - 
ijl'i-y)  .  Vy 


Proof.  The  proof  follows  the  .s.amc  attack  taken  in  proving  The¬ 
rein  1  and  is  contained  in  [4,5]. 

lutorpretatiou.  Theorem  2  difl'crs  from  Theorem  1  in  tliat  it 
allows  for  bounded  disturbjuiccs  and  unmodeled  dynamics  to  per¬ 
turb  the  double  integrator  model.  To  account  for  the  unmodelcd 
deviations,  condition  AG  discounts  the  available  acceleration  by  a 
conservative  amount;  the  absolute  worst  case  contribution  ol  the 
operator  Ai  and  the  disturbance  d.  The  factor  a  in  (3.19)  is  pre¬ 
cisely  the  discount  factor  wliich  can  be  adjusted  when  'yoo(Ai )  and 
||d|)co  arc  specified.  Thus  we  can  set: 

0  =  1-  ^^(Ai  {  sat(u)  +  d})  -  l|dl|oo  (3-42) 

Thus  condition  A6  can  be  stated  as; 

l/'(-y)/(-y)l<  no  -  “/'(-y)  (3.43) 

*•'2 

To  liavc  moves  which  &rc  close  lo  tiinc-opliiuaj,  tlic  unmodelcd 
dyn^nics  Aiid  disturl^anccs  must  be  small  enough  such  that  a  is 
nearly  one.  say  a  G  (.9,  .99].  In  terms  of  the  disturbances,  the 
condition  given  by  Theorem  2  is  vcr>'  rca.sonablc,  as  the  distur¬ 
bances  forces  sliould  be  much  smaller  than  the  available  force  to 
the  actuator. 

The  problem  with  condition  AC  is  tliat  it  is  too  conservative:  it 
docs  not  even  allow,  for  example,  A  to  be  a  real-pole  of  any  band¬ 
width  or  lime  constajit.  In  fact  the  types  of  uiimodcled  dynamics 
which  yield  reasonable  o’s  ait?  very  limited.  If  wc  consider  the  sys¬ 
tem  after  the  receipt  of  a  step  input,  and  a  linc.'tr  time-invariant 
set  of  unmodelcd  dynamics,  we  can  expand  the  rcingc  of  unmod¬ 
elcd  dynan\ics  for  which  stability  can  still  be  proved  (using  present 
techniques).  This  is  best  summarized  in  the  following  Corollary  to 
Theorem  2: 

tJorollary  2.1  (Step  input  at  t  =:  to  ).  The  system  given  by  Equa¬ 
tions  3.37-3.3$  is  globally  stable  if  it  meets  the  following  condi¬ 
tions: 

AI)  -AS)  of  'Theorem  1, 

AC)  A  is  a  stable,  linear  time  invariant  operator. 

A7)  A]  (.5  =  0)  =  0,  Aj  has  no  dr  gain,  I'his  is  a  formula- 
iion  constxaini.  Deviations  in  erpeeted  gain  arc.  considered 
as  vartalions  in  the  plant  gain  a. 

A3)  (1  -}•  0^2 A/s)“*  is  a  stable  operator 

MO  l/'(-y)y(-y)l<  a[l  -■7oo(A'2[A,/^])||u-l-,i|U-||d|U]- 
~/'(-y),  Vy  .  VO  („ 

Proof.  ^J'he  proof  is  along  the  lines  of  'J'hf'orern  2f  and  is  also 
given  in  ['1,5]-  'I'o  aj/jdv  ihis  coiollaiy  to  the  ie;iJ-pole  case  (A  — 
1  /( ^5  d'  1  ))i  "'c  need  the  peak  gain  of  A  j  /5: 

7cq(Ai /s)  -  700  ( - ^  -  7 

\  T5  -i-  1  / 

No(e  that  (lie  peak  gain  drops  low.ud  /.ero  as  tlu*  time  cfinstanl 
appioarhes  zero.  Setting  d  =  0  and  calrulaf  ing  |j  >'lk'  coiulilioii 
yield-- 

O'  <  1  —  ah■2^  (.'i.'l.')) 

whi<'h  has  the  qualitative  <'har<irtenstics  we  n<*ed:  as  (he  h/indwidth 
of  (he  real  pole  gr<#ws,  the  limit  on  or  diminishes.  'I’lms  for  t  0, 
Or  must  be  less  than  one  as  in  d'lieorem  ].  Hence  we  can  analyze* 
(he  elfects  of  practical  niiinodoled  dynamics  on  the  perfomwiiice 
of  the  system,  mo^inlo  the  iinjnjlse  reponse  of  (lie  nnmodeIe<i  dy¬ 
namics  term.  In  the  case  ol  the  real  pole,  the  perlonnaru  e  will  iKiL 
be  adversely  afiected  beyond  that  calculable  from  (d.*!.'!).  When 
very  under<l«’Un|^ed  unmodeled  dynaini^>,  are  inchjde<l  however,  the 
perfornianee  of  the  system  must  he  evaluated  via  simulation. 
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4.  APTCS 

In  this  section  we  will  derive  an  algoritlim  for  adjusting  the 
non-linear  control  law  of  Section  3.1  to  uncertainty  about,  or  slow 
changes  in,  the  plant  parameter  a.  Figure  4.7  depicts  the  structure 
of  the  system  with  adaptation  of  the  non-linear  control  function 
usir.g  information  identified  from  input/output  data  of  the  plant. 


Figure  f.7;  Structure  of  the  Adaptive  non-linear  Controller 


4.1  The  Identifier 


The  idcntificalion  sclicmc  will  be  bAsed  oil 
metric  model  of  the  plant: 

the  following  para- 

■  y-  a  sa((ij} 

(4.4G) 

Vi'Iiere  a  U  an  urikno>\n  con.stant.  VVe  also  assume  that  a  is  known 
to  be  in  the  interval  A  » 

(4.47) 

The  mea-sured  data  is  {  u  ).  a.id  (y).  Deiiotiii);  the  estimate  of  e. 
as  a,  the  fJlerei  eguciion  entir  is  defintd  as 

4*  t-*;  2  a  4' 

(4.48) 

where  z  aj’d  4’  defined  by 

V-frid 

(■1.4b) 

(4.50) 

wlicrc  u  ,TTp.wj>ciits  llic  LapUrc  traJisform  variable.  The  time  cori- 
fitan'it  Tr  affects  tlie  bandwidili  of  ihformatlon  used  by  the  identifier. 
Mid  should  be  cho{.'‘n  mur!j  smaller  than  tU*  time  constant  defined 
by  the  maximum  “learning’'  rate  for  a. 

To  illustrate  tho  ideas  we  propose  llie  following  sinipk*  identifi¬ 
cation  JKhcinc  (LMS): 

a  ~  y</)c  ('1-&1) 

e  ('1.52) 

wlicic  y  is  a  jjosjLivc  con‘jt/mt  ci/o.scn  to  give  a  reasoriable  Icanii.r\g 
rate,  depending  on  th<'  nmgnitudc  of  system  noise,  and  intended 

rlncz-d  lo^^p  li;»iidv.'}/hh  Tlu-  tuifi.'-tl  o<fii!iatf*  fnr  n  is  if* 

It  is  jx^ss/ldc  to  f.how  [l]  that  under  the  cof?djt}ou 
of  persistant  excitation  tliis  single  p/uanieter  identifier  is  exponen¬ 
tially  convergent  to  tlic  correct  va\1uc  ({ti}  and  {y/}  mc  in  the  mode* 
set). 


Parameter  Value 

"  "  To 

a  0.95 

g  .15 

T,  0.02 

Vd  10 

L  1  or  2 


Tabu  1:  Paramotcr  values  for  siniulatioriSs 

4.2  The  Adaptive  Cot^troller 


Using  the  structure  of  the  control  law  (3.19)  with  a  continuous 

/'(•)  ((3.22),  (3.2.3)),  an  adaptive  version  of  the 

PTOS  control  is 

u  Tz  Dnt(k3(J(ij.,p)  -  v)) 

(4,53) 

with  a  generated  from  (4. 51), (4. 52)  and  v;hcre 

. . .  ,  An  =  vvj/p  . 

-E .54) 

k'j  —  'ICdi^dlP 

'  (4.55) 

VI  =  T- 

.  (4.50) 

\  iK''(y<!)((2flQ’ jy.  1)'^^  -  1/fca)  Ior|ye|>j;j 

(.1.57) 


and  p  is  the  projection  of  d  into  a  known  region: 


niiii  |d-p|  (4.58) 

Hcncc,  if  the  estimate  a  is  in  then  p  =  a.  Otherwise  we  take 
the  nearest  value  in  A.  Tho,  following  theorem  gives  sujTidcnt  con- 
ditiom  for  stability  of  APTOS. 


Thoor<»iii  3  (Paraintitr  Convergence).  ikai 

fo-\T 

>  /f ,  v^r  e  io,</] 


(4.50) 


where  T  and  (i  are  po4iitiv<r  f.cuufanf,^  fndependent  oj  Then  Jor 
r(i)  ~  TO  ,  V(  >  If  =>  (y(/,),v(l),  ii(.'))  — •  (ro.O.O)  (4.00) 


7’lie  proof  of  'J  heoieiu  3  is  convAined 


L.  .SiMi)i.,A'r)o.N'  Examples 

la  this  section  wc  v/ill  exaiiiinc  the  I>erfo^ll^Lnoe  of  the  PXOS  and 
/  f^'rOS  via  sonic  simulation  cxmn;>]cs.  lable  i  >n'  aiiis  the  fi.xed 
p^vameteis  for  all  of  the  simulations,  Pcs|>on»c«  of  the  ideal  time- 
optimal  controller  and  Pl'OS  to  two  set  point  changes,  /y  =  1,  and 
L  =  2  arc  overlayed  in  Pigurc  5.1,  No  noise  was  abided  to  the  shnu- 
h'li  ion  of  (he  (ime-optiinal  controller  to  ilhistrate  the  l>ehavior  of  the 
so-oilled  ideal  system  The  unsaluratcd  PTOS  control  during  dc- 
<:clcia(ioii  (a  0.95)  afTinns  that  r.((o)  G  H  ic(f)  G  XJ  ,  V  f 
Figure  5.2  shows  P'iXlS  ro.s];onses  when  the  phint  gain  o.  is  not 
known,  motivating  the  desire  for  an  adaptive  PTOS.  Note  the  time 
r'*spoi7-se  becomes  slower  with  any  deviation  from  the  tnie  value. 
Adavi^iion  of  the  practioTJ  controller  to  a  value  of  a  =  1  from  an 
iiu<L/{\l  esUnmte  of  /i(0)  ri  1.5  is  shown  fu  Figure  5.3.  7'hc  initial 
estimate  of  a  is  too  large,  caasing  control  saturation  duri.ng  dcccl- 
ei’atif*ii.  Although  the  svsteiii  Ajioears  stcahle.  an  the  macnitude  of 
the  ^f:t  point  change  grows,  the  o*icillatoi-y  behavior  of  the  re.spoiise 
grow.s.  As  a  approaches  a,  the  control  does  not  i.aturate  during  de¬ 
celeration,  ixiid  the  clobed-l<>op  behavior  of  the  system  approaches 
that  of  tfie  ideal  P'l'OrS. 
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,  'S  ’i J  to  IS  20  2S  »  35  « 

IWCaocn^j) 

f'iauTc  S.3:  APTOS  transient  response  when  initial  estimate  oi  a 
is  too  liigh,  tinder  idea.!  conditions.  Note  changing  control  signal 
'^as  /(*,a)  is  adjo.sted.  - 

datioii  v.\s  derived.  It  wa'-  t'iiowii  that  lor  ideal  conditions,  the 


f'isurc  S.l!  Overlay  of  v  ai\d  u  lor  time-optimal  control  and  PI  OS. 


TWSM 


APTOS  vcsp-jiise  time  wo-jld  approach  the  ideal  time-optimal  re- 
spcuix;.  Pol-  the  OVre.  of  limited  input  authority  and  desiraliilitj 
of  last  tvaponsr.,  APTQ.S  lui  out-peridmis  standard  Self-Tuning 
Control  algoritluns. 

Be.oaJ.se  iiopUnienlalion  «sf  tin:  adaptive  non-linear  oontiol  law 
is  extremely  oomt>lex  and  cos'.iy  for  second-order  eases  otltcr  tlian 
the  tloiiblc  integrator,  a  discrete  liine  version  of  APTOS  lias  been 
dcvcloi)cd['l,8],  which  faeiUlalcs  itnplcvnonCation.  ■ 
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Adaptive  Proximate  Time-Optimal 
Servomechanisms:  Discrete  Time  Case 

M.  L.  Workman^  R.  L.  Kosut^-^,  G.  F.  Franklin^ 

Information  Systems  Laboratory 
Stanford  University,  Stanford,  CA  94305 


Attinct —  After  a  brief  review  of  the  continuous  time 
proximate  time-optimal  servomechanism  (PTOS),  a  dis¬ 
crete  time  PTOS  is  developed,  along  with  conditions  for 
its  stability.  An  algorithm  is  proposed  for  adapting  PTOS 
when  the  plant  parameters  are  not  known  yielding  AP- 
TOS  (adaptive  PTOS).  Under  ideal  conditions  APTOS 
is  shown  to  be  uciformally  asymptotically  stable.  Sim¬ 
ulation  results  demonstrate  the  predicted  performance, 
and  experimental  results  validate  the  practicality  of  AP¬ 
TOS. 

1.  I.NTRODUCTION 

In  this  paper,  we  will  develop  a  proximate  time-optimal  ser¬ 
vomechanism  (PTOS)  for  discrete  time  control  ofacontinuous  time 
double  integrator  plant.  After  establishing  stability,  the  PTOS 
system  will  be  extended  to  be  adaptive  (APTOS).  The  topic  of 
this  paper  is  an  extension  of  continuous  time  results  presented  at 
the  1S87  Automatic  Control  Conference,  and  hence  a  brief  review 
of  the  continuous  time  proximate  time-optimal  servomechanism  is 
presented  before  addressing  the  discrete  time  problem.  Both  the 
continuous  and  discrete  time  cases  arc  covered  in  Workman*s  dis¬ 
sertation  [l].  In  the  dissertation,  the  time-optimal  trajectory  was 
calculated  for  a  plant  modeled  by  a  real  pole  followed  by  an  integra¬ 
tor.  The  complexity  of  the  resulting  lime-optimal  trajectory,  along 
with  the  fact  that  it  is  a  transcendental  function  in  velocity  for  po¬ 
sition,  motivates  the  Ireuisilion  to  a  discrete  time  implementation 
of  the  controllei.  Although  the  discrete  time  system  is  at  least  as 
complex  as  the  continuous  time  system,  the  implementation  with 
microprocessor  teclvnology  is  much  simpler  and  less  costly  than  the 
analog  circuits  required  for  the  continuous  lime  case. 

2.  Review;  Continuous  Time  PTOS 

Consider  tin;  time-optimal  control  system  shown  in  Figure  2.1. 
The  plant  consists  of  a  double  integrator  driven  by  a  limiter  or 
saturation  block.  The  equations  describing  this  system  ai’e(2] 

i/  V  (2.1) 
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^Rescardi  siipport  for  bfjth  Df.  KosiU  .md  Dr.  Fiaiiklin  from 
NASA  Gr.u.i  NACi-^-.j.yj.  VJSP  (JfC.nt 


V  =  a  sgn(/co(y£)  -  v)  (2.2) 

=  sgn(y^)(2a  (2,3) 

where 

A 

yc  =  T-  -  V  (2.4) 

In  lliis  form,  it  is  helpful  to  defme  the  move  length  L  as  the  position 
error  at  the  reciept  of  a  new  reference  (step)  input; 

L  =  r(to)  -  y(lo)  =  ye(<o)  (2.5) 

The  control  law  given  by  (2.2)-  (2.3)  and  shown  in  Figure  2.1. 
although  time-optimal,  is  not.  practical  in  many  cases.  Even  the 
smallest  system  process  or  measurement  noise  will  cause  the  con¬ 
trol  to  “chatter"  between  the  maximum  and  minimum  values  [3) 
Removing  the  infinite  gain  operators  from  the  time-optima!  con¬ 
troller  gives  the  system  a  finite  bandwidth,  and  hence  is  much  more 
practical. 


Figure  £.1:  Time-optimal  controller  for  double  integrator  pl,ani. 

A  [iroximatc  time-optimal  servomechanism  (PTOS).  is  shown  in 
Figure  2.2.  The  function /(•)  is  a  finite  slope^  approximation  to  ili.- 
switching  function  Jto(  )  given  by  (2.3).  Tlie  signuiii  function  of 
(2.2)  has  been  replaced  by  the  "s.al"  function  which,  together  wiili 
the  gain  factor  hj,  can  be  llioiigiil  of  as  a  finite  slope  approximation 
to  the  .signum  function.  The  equations  which  define  this  “prac  ticai  " 
version  of  (2.I)-(2.3)  are 

y  =  V  (2.(.) 

V  =  a  r.at(/,v,[/(y..)  -  i/])  ('.’.7  ) 

where  tile  lunclion  /(■)  is  .as  yet  unspecified. 

The  following  llicorem  gives  the  lestiictioas  on  the  fiiiiclion  /(■) 
and  the  gain  which  wil]  guarantee  stability  of  the  system  (2.G) 
(2.7)  depicted  in  I'iguie  2.2  for  step  inpiKs, 


'''The  derivative  o(  n.  inlinitr  at  y  -  0. 


Theocetn  1  (Step  Inp%it).  The  zero  solution  0}  (2.6)  and  (S.7) 
is  flohally  asymptotically  stable  if  the  follouiiny  conditions  hold; 

Al)  ak2  >  0. 

AS)  /(O)  =  0. 

A3)  f(z)z  >0.V*7t0. 

A4)  lim._oo  f‘  J{S)  dS  =  00. 

/'(y)  =  <^/(y)/<^y  exists,  V  y. 

A6)  -a+  i^/'(-y)  <  -/'{-y)/{-y)  <  a  -  jL/'(-y)  .  V  y. 


Th«  proof  of  this  theorem  is  rather  long  and  is  contained  in  [l]. 

A  function /(•)  wliich  meets  the  conditions  of  Theorem  1  is  given 
by  (2.8). 


/(V.)  = 


{ 


»Sn(y«)((2aor  |y.  |)‘/^ 


for  |y<|<  yi 

for  |y«l>  vt 


(2.8) 


The  positive  factor  o  is  referred  to  as  tiie  accelei  ation  discount  fac¬ 
tor,  and  is  less  than  one  (0  <  or  <  1).  Equation  (2.8)  is  composed 
of  a  linear  region  (|  y.  |<  y()  and  a  nonlinear  region  which  U  a 
fairly  close  approxinxation  to  the  time-optimal  switching  function 
/lo-  The  linear  portion  of  the  curve  connects  the  two  disjoint  halves 
of  the  non-linear  portion.  To  connect  the  nonlinear  regions  of  /(•) 
such  tliat  /(•)  and  /'(•)  remain  continuous,  we  have  a  constraint 
on  the  gains  ki,  kj,  and  the  size  of  tlie  linear  region  yj: 


and  the  linear  region  is  just 


(2.9) 


VI-  (2.10) 

By  approziinaling  the  positioning  time  as  the  time  it  takes  the 
position  error  to  be  within  the  linear  region,  we  can  define  a  per¬ 
centage  increase  P  in  respon.se  time  of  a  PTOS  system  over  that 
of  a  minimum  time  control  of  the  same  pituit. 


P  =  iOif/o 


2  \  \/ci 


) 


(2.11) 


Note  that  tl»e  value  of  P  is  indep/ridrnt  of  the  si/e  of  the  step  input 
(move  length  L).  Although  reasonable  values  of  a  airr  usually  such 
that  o  6  (-9  .99],  is  j^Iotled  Uit  a  wide  r/uige  of  a  in  I^igurc  2,3. 

A  siniuiation  of  both  the  niininitiiii-tinie  controller  and  the 
PIOS  system  when  the  jylaju  gajn  a  is  known  in  shown  in  Fig¬ 
ure  2.  1.  l  or  the  case  shown,  th''  F  I  GS  system  is  1.3%  slower,  but 
has  much  improved  control  behavior  dining  lef^ulatioii  mode.  In 
additmr.  to  tlic  !i;i«  oi  i « it  lut*!  been  sln^wu  [i]  tiiai 


h'U'  l.,^ttcr  iol)Ustn<*ss  properties  i<>  small  changes  in  a, 
uiimodcled  dynamics,  am*  <],istui  l>;Ln<  es  acting  on  the  plant. 


Figure  t.S:  Percentage  increase  in  move  time  P  versus  ex.  The 
point  marked  on  the  curve  represents  P  =  1.3  %  for  a  =  ,95. 


Figure  2.4:  Overlay  of  y  and  u  for  iinic-opliinal  control  and  PTOS. 


Although  PIOS  is  robust  to  deviations  in  the  model,  perfor¬ 
mance  improvements  can  be  achieved  by  adapting  PTOS  ( APTOS) 
lo  changes  in  the  plant  [],A].  The  continuous  lime  APTOS.  applied 
lo  the  double  integrator  plfiJit,  performed  verv  well.  However,  al* 
though  it  is  possible  to  extend  APTOS  to  other  plants  sucli  as  a 
real-pole  followed  by  an  integrator,  the  resulting  velocilv  trajectory' 
/(')  becomes  unwieldy  to  adaptively  adjust,  7  i\is  implementation 
difficulty  can  be  overcome  by  implementation  of  a  discrete  time 
version  of  APTOS.  a.s  the  microcode  for  the  cfpiatiorus  or  look-up 
table  entries  for  /{*)  can  be  modified  very  easily  in  a  microproces¬ 
sor  or  sigii/ij  processor  based  system.  All  we  nee<l  is  the  theory  to 
piuceed. 

3.  Di.s(;reth  'J'i.mi;  Plant  Model 

Consider  a  doiibk'  itUegraf  or  pl.ant  driven  bv  a  Zf  ff>-Of  fler  hohl. 
As  in  the  rorninufius  time  rase  the  st.ites  ;uc  fJefined  as  position 
and  velocity.  With  an  insignificant  calculation  flelay  we  have  the 
following  disrieie  lime  stale  sp/u  e  fP'Sf  /  ij ,i  mn  ij,/*  j>laril: 

= --'  (  V  «'  )'  (■>■>*) 
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figure  4'^^  Discrete  time  proximate  time-optimal  servo  (PTOS). 
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3.1  Key  Approximation 


(3.13) 


Ill  ilie  conCiniioiis  if  .lil  u point  iruo  a  tif 

the  phase  plane  front  ihe  hotiiKhios  of  the  region,  it  is  fs\irlv  olo.u 
that  <any  trAjecloiy  originating  xnstde  the  regiott  ntvisi  be  trappi^i 
inside  the  region.  The  argxnncnt  hinges  on  the  cotuinnity  <if  the 
trajetories;  any  trajectory  originating  «»n  the  inside  of  a  region 
must  pass  tlirough  a  point  on  the  boundiy  to  leave  tlie  region,  and 
since  all  trajectories  originating  on  points  of  the  boundaries  go  to 
the  interior  of  the  region. 

Such  is  not  the  case  for  diso*ctc  time  aystcnis.  All  we  have  arc 
points^  not  continuous  trajectories.  I'hus  the  argument  used  in  the 
continous  lime  case  fails:  we  do  not  liave  to  have  a  point  on  the 
boondry  to  exit  a  region.  Notwitlistanding  the  diflicultics,  it  is 
still  possible  to  prove  stability  of  our  discrete  time  practical  time* 
optimal  controller.  The  difltculty  will  come  when  wc  try  to  clieck 
the  conditions  for  stability,  But  we're  getting  ahead  of  ourselves, 
first  let  us  state  a  theorem  for  stability. 


The  ideal  time-optimal  control  of  a  discrete  time  system  is  in 
general  very  complicated.  A  valuable  fact  however,  permits  us 
to  extend  the  strategy  used  in  continuous  PTOS  (4)  to  discrete 
time  systems:  as  the  sampling  rate  increases,  the  discrete  control 
tends  to  look  identical  to  a  sampled  continuous  time  control.  This 
does  not  mean  that  the  sampling  rate  must  be  necessarily  higher 
than  would  otherwise  be  selected  to  maice  the  approximation  valid. 
By  i\djusting  the  acceleration  discount  factor  a,  PTOS  will  leave 
enough  unsaturated  control  margin  while  trajectory  following  that 
diiTer<  nces  between  continuous  time  and  discrete  time  optinval  con¬ 
trol  strategies  can  be  absorbed. 


Theorem  2  (5<cp  Inputs).  The  zero  solution  of  the  system  given 
hy  (S.lSj  through  (S.JS)  u  globally  asymptotically  uniforvrily  stable 
if  the  folloufing  conditions  hold; 

-4  / ^  0  <  ok-qT*  <  2.  Note  the  upper  limit  in  contrast  to  the 
continuous  time  case. 

As)  -AS)  of  Thtorem  I 

<  /(-(y  +  Ay))  -  (v+  Av)  <  ,  V  (y,y)  e  u.  u 

corresponds  to  the  region  in  JMIe  space  where  the  control  is 
unsaturated. 


4.  Control  Design:  Known  Parameters 

The  proposed  control  structure  is  a  discrete  time  mapping  of 
the  same  proximate  time-optimal  control  law  as  iLsed  in  continuous 
time  PTOS  in  Section  2  with  some  sliglitly  different  condition.^  on 
the  function  /(-)  wliicli  will  be  discussed  later.  The  mapped  control 
law  is  then; 

u(A.*)=  sat(;c2(/(k.(A.))-t;(A)l)  (4.14) 

Note  that  the  definition  of  is  the  same  as  in  the  continuous  time 
case.  Dropping  the  sample  or  time  index,  we  liave: 

ti  =  sat(l;2(/(ytf)  -  1/])  (4.15) 

To  simplify  the  block  diagrams,  we  will  utilize  the  trausfer  cii.irac- 
tcrislic  of  tlic  D/A  convci-tcr  in  place  of  the  saturation  function,  as 
it  is  equivalent  to  the  two  tran.sfcr  characteristic.s  in  seiie?i  as  long 
as  the  smallest  saturation  alue  is  ij.s«-d.  Quantization  in  the  D/A 
and  A/D  converters  will  be  a.ssunie<i  negligible.  A  block  diagram 
of  this  system  is  shown  in  h  igtir^*  4..'j. 

5.  PTOS  S'l'AniLn"!' 


•■17;  i;'(y)i<  ,  vy, 

where  Ay  and  Av  are  defined  as  follows: 

ClT? 

Ay  -  7'iU+  -  u))  (.'i.lfl) 

Aw  =  a7’,  y)  -  v])  (5.20) 

Troof.  'I'he  proof  of  thi.s  theorem  is  along  the  lines  of  the  proof 
of  the  continuous  lime  proof  of  Theorem  1  and  the  details  arc  also 
contained  in  (4).  The  approacli  to  the  proof  is  useful  in  undcr- 
st.uiding  the  origin  </f  the  con.sii.iints  of  Tlu'^jicin  2.  Hefeiring  to 
I'igiir^  .5.0: 

1.  Show  ihai  the  .system  Mate  will  always  enter  a  icgimi  U  in 
si.iie  sprter  wluM’e  the  <:<»ni.n>l  is  \in.saiiJt  aied  (t:t>rKljli<in.s  Al, 
.47),  aji<l  llu’ji 

2.  Shf>w  th.it  onr<?  the  syst^on  stale  is  in  TJ  .  it  wilf  remain  in  U 
(condi<h»n.s  AO,  .47), 


In  terms  of  the  coiitinuot^s  time  units,  a  map  of  the  continuous 
time  controller  into  the  disacte  time  system  yields: 

y(A-4  1)“  y{k)  l\v{k)  -k-  aiT'l  l'l]u{k)  {.5.Ui) 

1)  :=  w(k)  +  a7,u(A)  (5.17) 

XL[k]  sat(/:,(/(r  -  y(A:})  l-(;.)1)  (.',.18) 

hor  what  values  of  k-^,  <‘uk1  I'i  is  tlw*  sysf/Tii  gis'eii  l,v  (.'i.lf,) 
through  (5.18)  stable  ?  Unfortnir/itel y  the  answei  lo  this  <pies(ion  is 
more  coinplex  than  it  is  in  the  coiitinuoiis  tune  case,  d'lie  increased 
complexity  is  ^lii<*  to  (he  lack  ol  continuous  t raj'''  t<jri<*s  in  tlie  stat/* 
spa<  e.  and  to  tlie  <unJytiraI  r<>inplexjty  of  disciete  time  byapinn^v 
functions. 


l  inally,  show  that  theie  is  a  Lyapunov  furK-iion  for  the  system 
111  the  r^’gion  U  (coiwiitions  41.  .42,  Ad.  A4,  .4.5.  A7). 

Il-oniarks.  Condition  ,41  djfi<*ri>  fimn  the  continuous  cyasc  in 
that  i\  jjuls  an  uj»;»er  hmH  on  the  value  of  akj']\.  7  his  limit  ef- 
fectivrly  puis  a  lower  limit  f.»n  the  r.;ii<^  of  s.inipiing  fie<ju<-ncy  la 
<le*.lie<l  Yelo«ity  loop  !;;ind  wir  1 1  h  If  k  is  cl'njseii  mi<  It  (hat  (he 
him  tioii  /{•)  IS  eiintinuoiis. 


an'l  U]n»n  7iiibsti(iiliMr,  ibis  intfj  coinljtion  41  w<‘  <jbtaiii 

A,  7'/  1 
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Figure  S.6:  Regioiu  U  ,  ,  and  S_  of  the  ph&se  plane.  The  solid 

curve  represents  the  points  (y,  v)  such  that  v  =  /(— y)* 

But  this  is  less  restrictive  than  the  constraint  imposed  on  by 

condition  A7  as  we  shall  see  later  in  Section  7. 

Condition  w46  is  the  most  difficult  to  analyze.  It  simply  states 
that  at  no  point  in  U  can  the  next  control  be  of  magnitude  greater 
than  or  equal  to  one.  It  is  difficult  to  determine  what  this  means 
directly  in  terms  of  restrictions  on  the  function  /(*)•  It  woold  help 
if  we  knew  whether  condition  approached  the  similar  condition 
for  the  continuous  time  case  as  the  sampling  time  approached  zero. 

6.  Stability  in  the  Limit 

Wc  would  expect  tKat  condition  A6  Theorem  2  would  approacli 
that  of  Theorem  1  in  the  limit  as  T,  — •  0.  To  verify  tliis  expec¬ 
tation,  we  will  substitute  the  plant  equations  for  Ay  and  An  into 
condition  A6  of  Theorem  2  and  find  the  Limits. 

To  begin,  we  will  assume  that  we  can  examine  the  condition  AC 
on  the  boimdaries  of  U  .  For  high  enough  sampling  rates  there  will 
be  points  in  the  neighborhood  of  the  boundaries,  and  by  continuity 
of  /(•)  we  know  that  conditions  of  positiveness  and  negativeness  of 
the  control  will  apply  in  those  neighborlioods. 

Condition  AG  of  Theorem  2  is: 

-  -L  <  /(-(y  +  Ay)  -  (u  +  Au)  <  (6.21) 

A'2  A'2 

with  Ay  and  A  V  given  by  (5.19)  and  (5.20).  On  tlic  positive  bound¬ 
ary  of  U  (u  =  1) 

Substituting  Ay  and  Au  on  the  positive  boundary  of  TJ  (u  =  1 ) 
into  (6.21)  and  taking  the  limit  of  all  three  terms  a.s  T,  — »  0  yielcb 
the  following: 

“Oo<(  /(-y)^  “]/'{  y)  <  <1  (6.22) 

and  thus 

-  CO  <  -/(-y)/'(-y)  <  a  -  (0.23) 

«2 

Checking  the  boundary  where  u  =  —  1  yields  the  following  inequal¬ 
ity; 

~  a  -f  <  /(-y)/'(-y)  <  oo  (0.24) 

h 

These  two  inequalities  together  yield  condition  >46  of  Tlieorern  1, 
namely: 

-  a  -H  ~/'(-y)  <  ~J{~y)J'i-y)  <  a  -  (6.26) 

Thus  the  constraint  .46  of  Theorem  2  apj)ro<tches  the  simila;  con- 
sraint  for  the  continuous  time  system  as  T,  -•  0.  7  his  is  a  pleasing, 
result,  especially  in  ligiu  of  the  complexity  of  the  discrete  Cirne 
analysis.  Besides  providing  a  validation  of  the  corulition  in  dis 
Crete  time,  the  preceeding  ^ulalysis  also  leads  to  tlie  rnnclusioii 


that  for  lugh  enough  sampling  rates,  the  function  /(•)  chosen  for 
the  continuous  time  system  should  meet  the  conditions  nccess.iry 
for  stability  set  forth  in  Theorem  2. 

Thus  the  velocity  trajectory  function  /(•)  used  in  the  continuous 
lime  PTOS  (2.8)  will  be  used  in  the  discrete  tunc  PTOS  as  well, 
along  with  the  conditions  for  a  continuous  magnitude  and  slope 
(2.9)-(2.10). 

6.1  Evaluation  of  condition  A6 

Choosing  /(■)  and  /'(-)  to  be  continuous  functions,  of  the  form 
(2.8),  has  a  benefit  in  that  it  allows  us  to  check  condition  AG  on  the 
boundaries  of  tl  (u*  =  1  and  u*  =  — l)(ll  *s  shown  in  Figure  ?7. 
This  is  easier  than  evaluating  AG  for  all  points  in  U  .  Evaluating 
AG  on  the  boundary  u*-  =  -f- 1  yields; 

—  1  <  ki[-f(y-T,f(y)-T,/k2+aT^)  +  /{y)  +  - - aT,]  <  1  ,  Vy 

*2 

(6.26) 

Checking  the  remaining  boundary  (u  =  —1),  yields: 

-I  <it2[-/(y-T./(v)-t-T./*:2-ari‘)-(-/(y)-.^-l-ar.]<  1  ,  Vy 

(G.27) 

Note  that  if  the  function  /(•)  is  symmetric  the  conditions  need  only 
be  checked  for  non-negative  values  of  y. 

The  conditions  pven  by  (6.26)  and  (6.27)  are  diflicult  to  analyze 
directly  (substituting  (2.8)  in  for  /(•))■  Irtstead,  a  numerical  anal¬ 
ysis  will  be  used  to  shed  some  light  on  exactly  what  the  tradeoffs 
are  in  choosing  T,,  and  a  given  a. 

Before  a  numerical  analysis  of  the  above  conditions  can  be  per¬ 
formed,  a  design  rule  must  be  chosen  for  tlie  free  parameters  in 
the  control  law,  namely  the  gain  fcj  and  the  deceleration  discount 
factor  a. 

7.  Linear  Control  Law 

Designing  the  linear  controller  for  the  discrete  lime  system  is 
again  more  complicated  than  the  continuous  time  case.  Let’s  define 
the  ratio  of  szunpling  frequency  to  desired  bandwidth  as  A' 

(7.28) 

Id 

For  most  discrete  lime  servomecliajiisms,  the  value  of  N 
greater  tlian  five.  Astrom  and  Wittemnark  [5]  claim  a  good  i*ulc 
of  thumb  is  to  have 


whichfoiX  —  .707  meaivs /V  5^14.  For  values  of  jV  as  large  as  these, 
one  would  expert  that  the  discrete  time  system  gain  should  be 
close  to  that  of  the  continuous  time  case  for  the  same  desired  closed 
loop  Iwindwidlh.  Unfortunately,  although  the  damping  ratio’s  arc 
very  close,  the  bandwidths  of  the  two  systein-s  aic  significantly 
difTcrent  when  yV  is  as  large  as  10  as  seen  in  Figure  7.7. 

Condition  A7  of  Theorem  2  put  a  restriction  on  the  slope  of  the 
function  /(•).  I>et 

c=  isup|/'(yll= 

2  y  2  k2 

Since  condjlioti  AT  is  that  c  <  1/-I.  it  folhjws  that. 


SubsCittiting  for  yieifis: 

<  -  (7. .31) 
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Fifure  7.7:  Closed  loop  bandwidth  and  damping  aa  a  function  of 
the  gain  ki ,  for  a  sampling  interveJ  of  1  second.  The  bandwidth 
and  damping  of  a  continuous  system  is  shown  by  the  dashed  line 
for  comparison. 


which  is  larger  than  the  result  given  by  condition  .41. 

A  much  more  difficult  task  is  that  of  checking  condition  46. 
Since  substituting  the  expression  given  by  (2.8)  for  /(•)  leaves  us 
with  something  which  appears  to  be  analytically  intractable,  we 
will  resort  to  checldng  the  condition  numerically.  A  program  writ¬ 
ten  in  Pascal  searched  for  the  /arrest  sampling  interval  Tj  whicli 
would  allow  condition  46  to  remain  true,  as  a  function  of  a  range 
of  given  ki’*.  From  ky  and  fcj,  the  closed  loop  (-3  dB)  bandwidth 
of  the  system  was  calculated  as  follows.  Given  ky,  calculate 


ka  = 

(^) 

1/3 

(7.32) 

Pi  = 

akyTj 

2 

+  akjT.  -  2 

(7.33) 

P2  - 

akiTf 

2 

—  afcjT,  1 

(7.34) 

r  = 

(7.35) 

the  closed  loop  bandwidth  can  be  found  from  (given  the  system 
has  complex  poles): 

1  2 


(7,36) 


A  plot  of  tho  minimum  N  An<i  maximum.  fcjr=  versus  or  is  shown 
in  Figure  7.8.  The 

In  summary,  the  desired  bandwidth  of  the  closed  loop  controller 
is  not  significantly  restricted  by  condition  47  of  Theorem  2,  but 
|is  restricted  by  condition  46.  In  other  words,  the  Lyapunov  func¬ 
tion  is  not  placing  an  active  constraint  on  N,  but  the  condition  of 
remaining  in  the  region  U  once  in  it,  or  avoiding  jumping  over  U 
entirely,  docs  place  an  ariive  constraint  on  the  sampling  rate  versus 
desired  bandwidth.  Tlii.s  is  evident  in  Figure  7.9.  The  constraint 
on  Af  could  be  avoided  by  changing  the  function  /(•)  to  account  for 
sample  lime,  but  only  at  the  cost  of  loss  of  perfonnance  (increase 
|in  response  time). 

8.  Unmodeled  Dynamics 

TAlfing  M.nmoddcd  dyraomics  into  account  in  Theorem  2  is, 
vithin  the  present  analytical  framework,  impossible.  Since  ana- 
l^iing  condition  AG  of  Theorem  2  explicitly  is  not  possible,  it  is 


or 


Figure  7sS:  The  maximum  value  of  and  minimum  value  of  N 

versus  or.  As  design  guides,  these  two  curves  could  be  considered 
constants:  N  >  6.2,  and  k\Tj  <  .49. 


Figure  7.9:  The  value  of  k\  7/  as  a  function  of  TV,  for  or  =  .95  and 
the  upper  limits  imposed  by  conditions  A6  and  A7. 


hard  to  imagine  that  tlirowing  in  more  complexity  will  help  in  the 
analysis.  Thus,  unfortunately  we  will  have  to  fall  back  on  Section  6 
and  make  the  assumption  that  for  fast  enough  sampling  rates,  the 
uiunodeled  dynamics  analysis  for  the  continuous  time  case  should 
apply(l].  While  lliis  is  not  particularly  satisfying,  when  coupled 
with  system  sinxulatioiis,  it  is  a  practical  approacli. 

The  design  procedure  for  discrete  time  is  thus  to  apply  continu¬ 
ous  time  measures  of  ujimodcled  dynamics  to  the  system,  chose  an 
appropriate  acceleration  discount  factor  or,  and  verify  tliis  choice 
by  simulation  of  system  pcrfonnaj^ce. 

9.  Adaptive  PTOS 

riie  same  structure  as  used  in  the  continuous  time  APTOS  will 
be  used  for  the  discrete  time  APTOS,  with  appropriate  clianges 
iii  th*  id«  ntir»<-«»t»uii  filgoi  itlun.s:  replace  tiie  plant  gain  a  by  the 
estimated  plant  gain  ain  (2.8),  and  adjust  the  control  gains  k\  and 
k'j  by  a  as  well. 
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9.1  Identifier 


To  fix  ideas,  we  will  propose  and  analyze  a  simple  one  parameter 
identifier  in  the  APTOS  structure.  In  terms  of  peljmcmials  in 
the  unit  delay  operator  9“*,  the  input-output  relationship  of  the 
double  integrator  plant  is 

(9~‘  +9"^)“=  ( - (9-37) 

VanomX?/ 

where  Onem  represents  the  nominal  or  expected  value  of  the  plant 
gain  a.  If  the  measured  data  is  {  u,  y,  t  =  0,  T,,2Tm<  '  ■  -  }  then  we 
can  form  an  equation  error  as  follows: 

t:  =  z-<i>  (9.38) 

where, 

d  =  (q-*  4  9“^)“  (9.40) 

Consider  the  LMS  identifier  for  the  normalized  plant  gain  a: 

e  =  z  —  dd  (9'4l) 

afc+i  =  ak+9d>c  (942) 

where  9  is  a  positive  constant.  Define  the  parameter  error  as 

d  =  donom  —  o  (9.43) 

The  following  Theorem  gives  conditions  for  convergence  of  tlvis 
single  parameter  identifier. 

Theorems  (Parameter  Convergence  // 

1-  S'I>1  €  (0,2) 

2.  37'  >  0,  >  ot  >  0  sued  lAat  Jot  all  k; 

(=T-l 

/?  >  <^^+,  >  « 

iken  the  parameter  error  a  — ►  0  aa  A;  — »  oo  eTponeniiallif  fast. 


Proof.  The  proof  is  contained  in  (1]. 

Thus  as  long  as  condition  2  of  Theorem  3  holds,  adaptive  con^ 
trol  based  on  o  works.  This  is  because  exponential  convergence  of 
d  gaurantees  that  at  some  point  the  function  /(y«,d)  will  meet  the 
conditions  of  Theorem  2  are  met  (a  <  1),  To  gaurantee  condition 
2  of  Theorem  3  holds,  there  must  be  enough  changes  in  the  input 
reference  command  r.  In  implementation,  the  parameter  estimate 
is  no<  updated  when  the  system  is  in  regulation  mode,  wliich  pro- 
vents  parameter  drift  due  to  lack  of  persistant  excitation.  Hence, 
after  a  countable  number  of  input  command  clianges,  the  system 
is  tuned. 

10.  Simulation  and  Experiment 

A  simulation  of  the  adaptive  behavior  of  the  discrete  time  AP* 
TOS  system  is  shown  in  Figvire  10,10.  An  experimental  setup 
using  a  torque  motor  and  an  inertial  load  was  used  to  test  the 
APTOS  system.  The  adaijtivc  behavior  of  the  exi^crimental  sys^ 
teiu  matched  simulation  very  well.  Once  tuned,  tlie  simulation 
and  measured  responses  were  extremely  elose,  as  <’.An  be  seen  in 
Figure  10. 


Figure  10. 10:  APTOS  behavior  to  an  initial  error  in  parameter 
estimate  of  50%. 


<■  .—I  ■,  > 

Figure  lO.II:  E.xijcriment  versas  simulation. 
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Abstract 

An  Extended  Proximate  Time-Optimal  Servomechanism  (XPTOS)  is  developed  for  the  control  of  a  flexible  structure 
with  a  single  structural  mode.  The  resulting  control  system  is  closed-loop  ,  and  embodies  in  its  structure  the  charac¬ 
teristics  of  a  time-optimal  control  law  and  the  fine  tracking  properties  of  a  properly  tuned  linear  regulator.  Simulation 
results  depionstrate  the  performance  of  the  XPTOS,  and  its  robustness  in  the  face  of  uncertain  plant  parameters. 

1  Introduction 

■  '  '  '  ,  ■  .  ■  ' 

One  of  the  challenging  tasks  facing  control  engineers  and  theorists  is  the  design  of  control  systems  to  achieve  rapid 
slewing  and  precision  pointing  of  large,  space  structures  (LSS).  With  increasing  demands  being  placed  on  the  design 
and  construction  of  lightweight  LSS  optical  tracking  systems,  the  need  has  arisen  for  sophisticated  control  algorithms 
that  make  optimal  itse  of  the  ma.cimum  torque  available  for  rapid  slew,  and  achieve  high  tracking  accuracy  after  the 
trackiiig  error  signals  have  become  sufliciently  small.  ' 

The  “ideal”  solution  to  this  problem  is  obtained  by  computing  the  optima!  open-loop  “switching”  actuation  sequence 
that  steers  the  structure  from  initial  to  final  (target)  position  in  minimum  time  [.5,6].  In  practice,  such  a  .solution  is  very 
difficult  to  obtain,  an.i  a  p-oximate  time-optimal  control  law  is  usually  derived  by  considering  the  rigid  body  motion 
only.  The  resulting  input  actuation  sequence  may  result  in  significant  e.\citation  of  the  structural  modes,  and  therefore 
it  must  be  modified  so  that  its  power  spectrum  has  significantly  lower  harmonic  content.  One  approach  is  the  sine- 
versne  torque  shaping  technique,  wliich  attempts  to  achieve  a  good  tradeoff  between  slewing  time  and  .structural  mode 
e.xcitation  (Ij.  The  resulting  control  law,  however,  isopen-  loop  and  therefore  the  control  system  become.s  very  sensitive 
to  modeling  errors  and  c.xogenous  disturbances.  In  order  to  overcome  these  dilliculties,  a  number  of  techniques  have 
been  proposed  which  combine  feedforward  sine-versino  torque  excitation  with  linear  feedback  information  to  provide 
for  active  damping  of  the  structural  modes  (.2,3] . 

In  this  paper,  we  consider  the  de.sign  of  a  practical  “proximate”  time-optimal  feedback  controller  for  a  fie.xible 
structure  with  a  single  structural  mode.  The  controller  blends  the  time-optimal  characteristics  of  a  nonlineai  (switching) 
control  law,  and  the  fine  tracking  properties  of  a  properly  tuned  linear  regulator.  Furthermore,  it  exhibits  good 
robustnc.ss  properties  against  plant  parameter  variations.  The  approach  pursued  here  follows  from  some  previous  work 
reported  in  [8,0,10]  on  a  proximate  time-optimal  servomechanism  (l^TOS)  for  a  rigid  structure,  with  high  frequency 
unrnodcled  dynamics  that  arc  outside  of  the  perfcnnance  bandwidth.  We  extend  tliese.  terliniques  by  taking  directly 
into  account  the  c.vistoncc  of  a  lightly  damped  mode,  which  may  possibly  be  within  ihe  desired  pcrfoimance  bandwidtli. 
The  resulting  control  law,  referred  to  as  the  extended  proximate  time-optimal  servomechanism  (XPTOS),  ret.s'us  tlic 
simplicity  of  the  basic  structure  introduced  in  the  P10.S  and  offers  attractive  advantages  when  compared  \vith  the 
approaches  conventionally  u.sed  for  time-optimal  control  of  large  space  structures. 

The  organization  of  the  paper  is  a.s  foUows.  Section  ‘I  contain.';  a  brief  description  of  a  proximate  timo-oijlimai 
servomechanism  (PTOS)  introduced  in  [S,9j.  Section  3  i.s  devoted  to  tlie  extended  proximate  time-optimal  seivo 
rv  a  t,  rl,u -nilpd  ovjuTnib'  i.b.-tf  illnstratr-.s  the  t;erformancn.  of  tile  control  ab'oritlim  introduced 

in  Section  3.  Finally,  Section  5  contains  the  conclu.sions  and  .suggestions  for  further  research. 


Figure  1:  Time-Optimal  Controller  for  Double  Integrator  Plant 


2  The  Proximate  Time-Optimal  Servo  (PTOS):  A  Review 

This  section  contains  a  brief  review  of  a  proximate  time-optimal  servomechanism  (PTOS)  described  in  Workman, 
Kosut,  and  Franklin  [9]  for  a  double  integrator  plant. 


2.1  The  PTOS  algorithm 


Let  the  plant  P  be  described  by  the  system  of  equations 


y  ~  V  , 

V  ~  au , 


where  v.  and  y  denote  the  plant  input  and  output  respectively,  a  is  the  maximum  acceleration  available,  and  lii(t)l  <  1. 
Let  the  plant  state  x  be  defined  by 

=  [yv]. 


Given  an  initial  state  Xq  ~  [yo  Vq]  at  time  t  =  0,  suppose  it  is  required  to  steer  x'q  to  a  target  state  ==  hO]  in 


minimum  time.  The  resulting  time-optimal  control,  depicted  in  Figure  1,  is  given  by  [6] 

y  =  sgn  (/,o(c)  -  v)  , 

Ao(e)  =  5gn(e)(2a|el)l  , 

e  =  r-y  , 

where 


(1) 

(2) 

(3) 


2>  1 

sgn(^)  =  S  0  , 

z  =  0. 

(4) 

1  -1. 

z  <  1 

The  control  law  given  by  (l)-(4),  although  optimal,  is  not  practical.  In  fact,  any  process  or  measurement  noise  will 
make  the  control  signal  v  “chatter”  between  its  maximum  and  minimum  values,  thus  exciting  unmodeled  dynamics 
that  are  always  present  in  a  more  accurate  model  of  a  real  plant.  Even  in  the  absence  of  exogenous  signals,  the 
time-optimal  controller  lacks  robustness  with  respect  to  neglected  dynamics,  since  a  lightly  damped  mode  will  induce  a 
limit  cycle  (Workman,  [o]).  To  circumvent  this  difliculty,  a  mCi.e  practical  implcmcnta'.iozi  o.  a  ..imc-  cp^.imai  CGuviCiiv.. 
was  introduced  in  [8].  The  resulting  control  system,  referred  to  as  a  pro.ximate  time-optimal  servomechanism  (PTOS), 
is  diagrammed  in  Figure  2. 


64 


Figure  2;  Proximate  Time-Optimal  Servomechanism  (PTOS) 


The  equations  that  describe  this  “practical”  version  of  the  optimal  control  law  are 


where 


and 


y  =  y, 

V  ~  a  sat(l:2[/(e)  —  t»])  ;  Fj  >  0  , 


+1  ,  z  >  1 

^  .  I-I  <  1  , 

— 1  ,  z  <  — 1 


J{c) 


sgn(c)  [(2aajep/2  j 


i^i  <  yi 

|e|  >  yi 


(5) 

(6) 

in 

(8) 


In  (8),  ii,  k2  and  t/j  are  positive  real  constants,  and  0  <  or  <  1,  referred  to  as  the  accelcrniion  discount  factor,  is  a 
tuning  parameter. 


The  fmite-gain  operator  sat(-)  ha.s  replaced  the  infinite  gain  operator  Egn(-),  and  the  switching  function  /(o(e) 
(the  derivative  of  which  is  infinite  at  e  —  0)  has  been  substituted  by  /(e),  which  is  linear  in  the  range  jej  <  yi.  To 
connect  the  nonlinear  parts  of  /(•)  in  (8)  such  that  /(•)  remains  continuous  and  has  a  continuous  first  derivative,  the 
following  additional  constraints  v/ere  imposed  in  [8,9]; 


(9) 


and 


(10) 


Stability  of  the  resulting  dosed-loop  system  (for  step  inputs)  follows  from  the  fact  that  tlie  zero  solution  of 


ii  -  V 

V  ~  a  ;.;<tt  {hVi-y)  ~  v]) 

IS  giobaily  asymptotically  stable  (see  [y,  Theorem  3.1]).  In  addition,  the  cioscd  loop  system  is  dose  to  time-optimal  in 
the  following  sense; 

bet  Xq  ~  [ro,0l  denote  the  jilant  initial  state,  and  suppose  it  is  desired  to  steer  to  [r, (jj  in  minimum  time.  Using 
the  (time-optimaU  cendro!  law  m  (l)-(-1),  the  imnimurn  maneuver  time  topi  is 


where 


X^lr-rol- 

We  now  consider  the  PTOS  system  described  in  (5)-(8).  for  large  set  point  changes  such  that  L  »  the  time 
it  takes  for  the  closed  loop  system  to  settle  in  the  linear  region  of  operation  can  be  approximated  by 

tj(L)  —  — ^>/L4-  — ^  ~L  ,  (12) 

where,  as  shewn  in  [S],  i,{L)  is  such  that  lr-y(t,(L))l  «  ^  .  Defining  P  as  the  percentage  increase  in  maneuver  time 

of  the  practical  controller,  it  follov/s  from  (11)-(12)  that 

. ■  . ' 

- Clearly,  P  is  independent  of  the  move  length  L.  Moreover,  P  is  close  to  0  for  values  of  a  close  to  1.  The  above 

analysis  is’based  on  the  assumption  that,  in  the  linear  region  of  operation,  the  dynamics  of  the  closed-loop  system 
decay  sufficiently  fast  ou  the  time-scale  of  r,(f).  This  can  be  achieved  by  proper  selection  of  k,  and  a,  as  described 

below. 


2.2  Designing  the  Linear  Controller 

Let  H{s)  =  y(s)/ii(s)  denote  the  transfei  function  of  the  closed-loop  system  of  Figure  2,  when  opera.ting  in  the  linear 
region.  Witli  the  usual  notation  for  a  second  order  system,  and  using  (9),  it  follows  that 


"(■')  =  nxT 


S'*  +2CdW<is  +05 


where 


Thus  the  overall  closed-loop  system  bandwidth  wj  and  damping  lactor  (d  «an  be  independently  controlled  by  the 
parameters  ti  and  a,  respectively.  As  shown  in  [8,9],  the  PTOS  algorithm  cm  be  applied  to  tlie  control  of  a  puiely 
rigid  body  with  high  freanency  unmodeleds  dynamics  that  arc  outside  of  the  performance  bandwidth  Wj.  However, 
the  structural  modes  are'within  the  desired  closed  loop  system  bandwidth,  the'i  they  must  be  treated  explicitly  in  the 
initial  design.  This  i.s  the  subject  oi  the  next  section. 


3  The  Extended  PTOS  Algorithrr  'X.PTOS) 

In  this  section  we  address  the  problem  of  designing  a  pro.Minatc  time  optima!  coiitioilcr  for  a  fie.xible  largy;  space 
structure.  In  this  preliminary  study,  we  re.slrict  ourselves  to  t!  e  (Ideali^d)  case  where  tin:  .structure  /■  to  be  contiollec 
is  linear  and  contains  a  single  structural  mode.  Hence,  P  can  be  described  by  the  trati.sfer  function 

-T7Y--~r  -v, 

^  ^  .-A  -r  U', 

where  and  (  denote  the  frerpiency  and  damping  of  iiie  Kiractui.d  m.cd,-  ,  a,,.,:..;.; v.Jy,  aud  a,  and  c,  arc  real  imm;  err. 
We  assume  thh-  botli  output  portion  and  rate  nieanureineiits  me  available  lor  u-casureinent.  In  this  case,  tlie  pl-.mt 

admit.s  the  stale-space  realization 

X  /Lx'dAu, 

y-.u  I 

0(1 


where 


0 

1 

1  0 

0 

r  o  n 

A  = 

0 

0 

i  0 

0 

,  B  = 

o  c 

,  c  = 

'1010' 
0  10  1 

(18) 

0 

0 

1  0 

'  1 

LoJ 

0 

0 

1  -w’ 

1 

-2Cw„  _ 

z=  {y,y),  and  x'^  =  (irr>Vr,I//,  vy)  denotes  the  state-space  vector.  We  shall  henceforth  refer  to  (yr.Vr)  and  {yj,vj) 
as  the  rigid  body  and  structural  mode  coordinates,  respectively.  ■ 

(■ 

3.1  Proximate  Time-Optimal  Control  of  a  Flexible  Structure 

Consider  the  autonomous  linear  process  (17)-(1S).  Given  an  initial  state  lo  at  time  i  =  to,  suppose  it  is  required 
to  steer  xq  to  a  final  (target)  state  z/  in  minimum  time,  by  using  inputs  u  such  that  |u(t)l  <  ].  The  problem  of 
time-optimal  control  thus  defined  has  been  the  subject  of  e.xtcnsive  research,  and  a  vast  body  of  literature  is  available 
on  its  theoretical  and  practical  aspects  (see  Lee  and  Marlais  (5)  and  Oldenburg  (6)  for  a  rigorous  e.xposition).  Under 
fairly  generic  conditions  the  optimal  steering  controller  Uept(i)  e.'dsts  and  is  of  the  relay  type,  i.e.  it  assumes  the  values 
±1  and  switches  a  finite  number  of  times.  A  closed-loop  implementation  of  the  optimal  control  law,  hov/ever,  require.^ 
the  construction  of  high-order  switching  surfaces,  for  systems  with  state-space  dimension  greater  than  two,  these 
surfaces  become  so  comple.x  as  to  e.xclude  their  use  in  the  implementation  of  practical  time-optimal  control  algorithms. 

An  interesting  approach  to  the  solution  of  this  problem  has  been  reported  by  Kalman  (in  Oldenburg  [6]),  who  used 
linear  iran.sformations  in  the  phase  space  to  replace  a  “high-order  sy.scem  with  a  second-order  system  which  closely 
approximates  the  former”.  In  particular,  <3  nearly  optimally  compensated  third- order  saturating  servomechanism  was 
described  for  the  plant.  G(s)  =  ‘■insisting  of  a  pure  inertia  and  a  first-order  lag.  The  resulting  control  law 

re!ie.s  on  a  single  switching  curve,  and  is  effective  if  the  dynamics  of  the  first-order  lag  are  sufficiently  fast.  Clearly,  this 
procedure  is  not  directly  applicable  to  the  fie.xible  structure  described  in  (17)"(18)i  since  the  structural  mode  typically 
has  very  small  damping  <,  and  therefore  a  very  long  settling  time. 

In  Schmidt[7],  a  technique  for  the  design  of  nonlinear  saturating  controllers  to  achieve  proximate  time-optimal 
control  of  high  order  plants  was  described.  In  what  follov/s,  and  motivated  by  the  work  reported  in  (6)  and  (7],  we 
propose  a  (nonlinear)  feedback  sciicme  for  pro.vimate  time-optimal  control  of  (17)-(1S)  that  preserves  the  basic  structure 
ofPTOS. 


3.2  The  structure  of  the  XPTOS 


The  rationale  behind  the  XPTOS  control  syste;.’i  can  be  e.cplained  as  follows;  the  dymarnic  behavior  of  tlie  flexible 
structure  described  in  (17V-(18)  can  be  decomposed  iulo  its  rigid  and  structural  dynamics,  which  are  completely 
specified  by  the  state-space  coordinates  (unU,  )  and  {yj,vj)  respectively.  To  sleer  tin;  structure  from  ■.an  initial  to  a 
final  (target)  position,  apply  the  PTOS  strategy  to  control  the  rigid  body  motion,  and  blend  it  smoothly  into  a  linear 
state-feedback  control  law  that  actively  damps  out  the  structural  modes.  The  re.sulting  scheme  is  depicted  in  Figure  3. 


Let  X  ;=  [yriVriVj  ,v/]  denote  the  cstirrrate  of  x  (y.-,  Vr,  y; ,  vy),  obt.aincd  from  the  state.-c.stimator  dynamics 

X  -  Ax  -f  iJu  -1  Ko[yn>  -  C'i]  ,  (lb) 

where  Ko  i.''.  such  Uiat  {A  —  KqC)  is  ar.yjiipt.otically  stable.  Then,  the  following  description  of  XPTOS  is  obtained: 


X  zr.  At. hsal[u)  , 


(7.0) 


^rri  —  1 


where 


La 


f  {<■■)  i), -- --ij,  -  , 


(7!) 


r  -  Hr  , 


07 


measurement 


Figure  ,?:  Extended  Proximate  Time-Optimal  Servomechanism  (XPTOS) 

/{•)  is  defined  in  (S),  and 

=  (Iti  Ft  ^23) 

is  such  that  (A  —  BK,)  is  asymptotically  stable. 

3.3  XPTOSt  Design  Methodology 

The  purpose  of  this  section  is  to  provide  some  guidelines  for  the  design  of  the  XPTOS  system  to  accomplish  rapid 
slewing  of  the  flexible  structure  described  in  (17)-(18).  In  a  typical  retargeting  maneuver,  the  objective  is  to  steer 
rapidly  the  output  y  from  0  (at  .ime  i  =  0)  to  the  final  target  value  ro,  and  remain  there.  To  accomplish  this,  we 
propose  a  design  methodology  that  is  based  on  the  following  control  strategy:  during  the  initial  phase  of  the  maneuver, 
let  the  nonlinear  characteristics  of  XPTOS  takeover,  in  an  attempt  to  steer  the  rigid  body  dynamics  {vr,  Vr)  from  (0,  oj 
at  t  =  0  to  (ro,0)  in  minimum  time;  in  the  last  phase  of  the  maneuver,  use  the  properties  of  a  properly  tuned  linear 
regulator  to  achieve  high  tracking  accuracy  and  active  damping  of  the  structural  mode. 

It  is  clear  from  the  considerations  above  that  the  following  constraints  (inherited  from  the  PTOS  design  methodolo^'y 
described  in  Section  2)  must  be  observed; 

(Cl)  P,  -  ,  (C2)  0  <  a  <  1. 

In  practice,  the  degree  of  flexibility  of  the  structure  places  a  lower  limit  on  the  size  of  the  maneuver  for  which 
the  ‘Tang-  bang” characleri.stics  of  XPTOS  should  be  explored.  Thus,  given  L,„cn  select  so  that 

(C3) 

^min 

V/c  now  examine  the  coiistr2.int.s  that  arise  from  the  performance  requirements  in  the  linear  region  of  operation. 

Consider  the  XPTOS  .system  diagrammed  in  Figure  3,  and  described  by  equations  (18)-(23).  In  the  linear  region 
of  operation,  and  in  the  ab.sence  of  any  input  signal  r,  the  system  can  be  simply  viewed  as  the  combination  of  a 
state-observer  and  a  linear  regulator.  The  evolution  of  the  feedback  sy.sleni  is  then  governed  by  the  eouatioiir, 

i  Ax  —  JJK,t.,  (2d) 

i  -  (A- KoC  ~  JJK,)z  ,  (25) 

vdiere 

A',  -  [li  ■  (26) 

Furtlieniiore,  the  poles  of  the  intercounecled  sy.stem  consist  of  the  observer  poles  (eigenvalues  of  A  —  KqC),  together 
with  the  regulator  poles  (eigenv.aluM  of  A  —  BKj).  The  first  requirement  on  tlic  system  (2,d)  (2G)  is  that  it  be 
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asymptotically  stable.  In  addition,  Kq  and  K,  must  be  selected  so  that  the  XPTOS  e.xhibit  good  tracking  properties 
(i.e.  fast  dynamics)  in  the  linear  region  of  operation. 


3.3.1  Selection  of  K, 

The  PTOS  design  methodology  detailed  in  section  2.2  reduces  to  the  solution  of  simple  algebraic  equations.  A  similar 
procedure  may  be  derived  for  the  design  of  XPTOS.  In  fact,  it  is  straightforward  to  derive  a.n  e.\pression  for  d(s),  the 
regulator  characteristic  polynomial,  in  terms  of  K,  and  a.  Furthermore,  given  a  and  any  set  S  of  desired  closed-loop 
(comple.x  conjugate)  eigenvalues,  it  is  possible  to  ascert  the  existence  (and  in  the  affirmative  case  compute)  K,  so  that 
K,  satisfies  the  constraints  (Cl)-(C3),  and  the  roots  of  d(s)  equal  E.  This  method  becomes  rather  cumbersome  (and 
therefore  not  practical)  if  extended  to  plants  with  more  than  one  structural  mode.  With  this  objective  in  sight,  we 
describe  a  different  design  method. 

Assuming  the  state  vector  x  is  available  for  measurement,  we  seek  a  linear  control  law  of  the  form 

=  ~K,x  ,  (27) 

where  K,  =  (ki  JL-2  ta  is  suitably  chosen  to  minimize  the  quadratic  performance  index 

•^  =  /  :  (2S) 

Jo 

/?  >  0  and  Q  symmetric,  positive  scmidefmiic 

subject  to 

X  —  Ax  bu  •,  z(0)  —  zo  (29) 

In  the  sequel,  we  will  refer  to  the  problem  described  above  as  the  linear  quadratic  regulator  (LQR)  problem.  Let 
Q  =  z  for  some  tow  vector  2,  and  let  the  pairs  (.A, 6)  and  {A,z)  be  stabiiizable  and  detectable,  respectively.  Then, 
K,  is  obtained  from  the  positive  definite  solution  of  an  algebraic  Riccati  equation,  and  the  resulting  feedback  control 
law  stabilizes  the  system  (29)(Kwakcrnaak  and  Sivan,(4]). 

We  remark  that  for  our  purposes,  the  LQR  design  methodology  is  simply  viewed  as  a  design  tool  that  provides  a 
systematic  way  of  “scanning”  a  large  set  of  stabilizing  feedback  laws.  Thus,  p  and  z  arc  tuning  parameters  that  control 
the  location  of  the  regulator  poles  to  achieve  good  tracking  properties. 

The  inclusion  of  the  con-straints  (Cl)-(C3)  into  the  LQR  design  problem  is  considerably  difiicuU,  and  we  tbeiefore 
propose  the  following  iterative  algoritliin  for  the  de.sign  of  XPTOS: 

Step  1.  Select  an  initial  estimate  for  2,  and  let  p  vary  in  some  interval  [p,n\n,  Pimx]-  Yoi  each  value  of  p,  solve  the  LQR 
problem  (28)-(29)  to  obtain  a  stabilizing  feedback  gain  Kj{p). 

Step  2.  Plot  and  a  2kilk\ar  a.',  functions  of  p,  and  check  that  a  value  of  p  p‘  e.xists  for  which  (CI)-(C3)  are 
satisfied.  In  case  these  constraints  are  not  satisfied,  modify  z  and/or  the  interval  , p,nax]  and  go  back  to 
Step  1. 

Step  3.  Check  the  final  location  of  the  regi;lalor  poles,  (i.e.,  eigenvalues  of  A  -  UK,).  As  a  design  rule,  require 
the  “dominant”  tirne-comstant  of  the  regulator  system  to  be  much  smaller  than  <,,,((L,^iii)  •'  2(L,„i„/ar)’^', 
where  fopi(Lniiii)  the  (theoretical)  niinimum  time  required  for  rctaigeting  of  the  rigid  body  only  (i.e.,  make 
mini?f[A,(A  —  BK,)\  >>  l/<c/;)((Lmiri))-  If  fkis  condition  is  not  met,  modify  2  and/or  the  interval  [pniin.Ptn,-ix], 
and  go  back  to  Step  1. 


Remark  3. 1  In  the  analysis  alawe,  v/e  esche’.v  the  question  of  crcistence  of  a  vector  A',  .such  that  (LT.)  -(CXi)  are  satisfied, 
<3,iiu  K,  solve.',  th.-:  LQR  problem  Rr  soi.u''  rbr.iri-  of  p  and  2. 

Remark  3.2  The  choice  of  the  qu.antitif  s  p  and  z  is  not  straiglilforv/ard,  and  requires  some  insight  into  how  they 

th.e  rekative  magnitude  of  the  gain.'.  I  k-,,  P3  avd  ky_,  ar.  v.-ell  a':  the  locatioii  of  the  T'-giib.o.r  iderable 

insight  into  this  jjroces.i  is  g.iiiied  l)>  examining  the  a.syiiij..totic  behavior  of  tie/  ie;pil,atoi  poles  as  p  -  ♦  0  aie! 
p  —*  00  (Kwakern..rak  and  Siv.ui  [4]j. 

fi!J 


3.3.2  Selection  of  Kq 


Without  any  constraints  imposed  on  its  elements,  Kq  is  simply  obtained  as  the  optimal  solution  to  a  stochastic 
observer  problem  (Kwakernaak  and  Sivan,  (4]).  In  this  case,  the  state  excitation  noise  and  the  measurement  noise 
characteristics  act  as  tuning  parameters  to  force  the  dynamics  of  the  observer  to  be  sufBciently  fast  compared  with  the 
regulator  dynamics. 


4  XPTOS  Performance;  An  Illustrative  Example 

In  this  section,  we  examine  the  performance  of  the  XPTOS  via  a  simulation  example.  As  a  representative  test  case, 
we  have  selected  a'flexible  structure  for  which  the  PTOS  algorithm  (designed  without  consideration  for  the  structural 
mode)  gave  unsatisfactory  results.  The  performance  of  the  XPTOS  is  compared  against  the  slew  maneuver.s  that  are 
obtaibned  by  using  a  sine-versine  opendoop  control  law  fl).  Furthermore,  we  examine  the  robustness  of  both  algorithms 
with  respect  to  parameter  variations. 

The  flexible  structure  to  be  controlled  has  the  state-space  representation  described  in  (17)-(18),  with 

Qf  —  Z  ,  ay=50,C  —  -OOh  ,  and  w„  =  26.8  rad-scc~^  . 

The  design  methodology  described  in  Section  3  gives  the  following  values  for  the  XPTOS  parameters; 

.8, 

(12.67  3.25  .74  .65]  , 

29.01  3.92  ■ 

12.50  9.18 

11.27  -S.52 

-58.53  101.15  _ 

The  structure  was  subject  to  rest-to-rest  maneuvers  with  amplitude  L  =  1.  In  order  to  compare  the  different  control 
strategies,  we  have  (arbitrarily)  defined  the  maneuver  time  tm  as  the  time  after  which  the  error  signal  r  -  y  becomes 
smaller  than  .018.  With  reference  to  Figures  4  through  8,  the  graphical  summaries  of  the  state  and  control  time 
histories  are  discussed. 


Case  J.  (Fig.  4)  is  a  rest-to-rest  maneuver  using  the  XPTOS  algorithm.  The  initial  phase  of  the  maneuver  aims  at 
rapidly  steering  the  rigid  body  stale  (i/r>t'r)  =  (0.0)  fo  the  ncigborhood  of  the  final  target  value  (1,0).  This 
results  in  considerable  excitation  of  the  flexible  inode.  However,  its  amplitude  is  rapidly  reduced  after  the 
initial  surge  in  the  input  control  profile,  la  fact  ,  the  maneuver  lime  equals  1.32  seconds.  Compare  with 
i„pt  =  2(1/3)^/^  =:  1.1547,  the  (theoretical)  iniiiiiiuim  time  required  to  steer  the  rigid  body  only,  in  the  absence 
of  the  flexible  mode  (cq.  (H))- 

Case  2.  (Fig.  5)  is  presented  as  evidence  of  the  “stabilizing”  effect  of  the  nonlinear  switching  function  /(■)  in  (8).  Let 
the  function  /  in  (21)  be  defined  by  /(e)  =  (Li/L2)c  for  all  e.  The  resulting  control  i,ystem  is  of  the  relay-  type, 
with  a  single  nonlinear  elerrient  at  the  input  of  the  fie.xible  structure.  The  response  is  highly  o.scilatory  throughout 
the  maneuver,  and  the  maneuver  time  increa.ses  to  1.95  seconds. 


Case  3.  (Fig.  6)  is  a  rest-to-rest  maneuver  using  the  (open-loop)  sine-versine  control  law  [1] 


==  o 


4\/3  / sin  2izi 
~9~  V  A  t 


/  2-,:  A 

I  i  —  COS - 

V  ^rj 


where  the  (theoretical)  rigid  body  slew  time  is 


Ar  ^ 


1.9046  seconds  . 


The  resulting  maneuver  cuise:;  less  excitation  of  tlie  structural  mode  (in  conipari.'.on  with  (jnse  1),  but  i;.  noliceably 
slower,  with  f,n  — 1-6  seconds. 
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Case  4.  (Fig.  7)  displays  the  robustness  of  the  XPTOS  with  respect  to  plant  parameter  variations.  We  notice  that  in 
both  cases  considered,  im  <  1-36  seconds. 

Case  5.  (Fig.  8)  depicts  the  inherent  lack  of  robustness  of  the  sinc-versine  control  algorithm.  Clearly,  a  practical  control 
system  based  on  this  algorithm  requires  switching  to  a  linear  regulator  at  the  end  of  the  sine-versine  rigid  body 
slew  time  At.  However,  this  will  further  increase  the  total  maneuver  tim-,. 

.  \  . 

5  Conclusions  and  Suggestions  for  Future  Res^^arch 

In  this  paper,  we  have  developed  an  Extended  Pro.ximate  Time-Optimal  Servo  (X.PTOS)  for  the  control  of  a  flexible 
structure  containing  a  single  structural  mode.  The  performance  of  the  closed-loop  control  algorithm  was  illustrated  with 
a  representative  example,  which  displays  the  e.xcellent  slewing  and  tracking  properties  of  the  XPTOS.  A  comparison 
was  made  with  typical  slew  maneuvers  obtained  by  applying  a  sine-versine  control  law. 

Future  work  will  requite  a  rigorous  analysis  of  the  stability  of  the  XPTOS,  following  the  methodology  proposed  in 

[8].  It  is  our  objective  to  refine  the  design  technique  e.xposed  in  section  3  in- order  to  accomodate  disturban:es  and 
sensor  noise,  and  provide  some  robustnesss  against  the  presence  of  neglected  higher-frequency  structural  mode.^. 
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Figure  6:  Case  3,  Performaucc  of  the  Siue-Versine  Control  Algorithm 


riy7.  (stcr!n.tj 


Figure  7;  Case  4.  llobustncss  of  tlie  XPIOS 

-  Wn  — .  30  rad'Sec'^  ;  a,.  =  3.o 

-  -  -  ♦  Wn  ^  24  rad-sec~^  ;  —  2.G 
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Figure  8:  Case  5,  Nonrobustness  of  the  Sine-Vetsinc  Control  Algorithm 

-  =  30  rad-scc“'  ;  Cr  -•  3.5 

....  Wn  =  24  rad-sec“^  ;  Or  —  2.6 
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Published  in  the  Proceedings  of  the  American  Control  Conference,  Pittsburgh,  Pennsylvania, 
June  1989. 
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Abstract  The  Extended  Proximate  Time-Optimal 
Servo  (XPTOS)  developed  in  [8]  is  analyzed  for  stability 
using  new  techniques,  ?nd  the  performance  of  it  com¬ 
pared  against  alternate  methods  such  as  sine-versine  for 
the  fast  slewing  of  flexible  structures.  The  XPTOS  sys¬ 
tem  operates  in  closed  loop,  and  blends  in  its  structure 
the  characteristics  of  a  time-optimal  control  law  and  the 
fine  tracking  properties  of  a  properly  tuned  linear  reg¬ 
ulator.  This  study  is  addressed  to  the  (idealized)  case 
of  flexible  structures  that  contain  a  single  or  dominant 
structural  mode.  Simulation  results  demonstrate  the  per¬ 
formance  of  the  XPTOS,  and  delineate  its  range  of  ap¬ 
plicability. 

1  Introduction 

The  advent  of  lightweight  flexible  structures  has  pre¬ 
sented  engineers  and  theorists  with  a  multitude  of  chal¬ 
lenging  tasks  in  the  area  of  control  systems  design  (e.g., 
[2]  I  (3],  [6],  [7]).  Required  arc  sophisticated  control  al¬ 
gorithms  that  make  optimal  use  of  the  maximum  torque 
available  for  rapid  slew,  and  achieve  high  positioning  or 
tracking  accuracy  after  the  position  or  tracking  error  has 
become  sufficiently  small. 

In  this  paper,  we  develop  a  practical  time-optimal  con¬ 
troller  for  rapid  slewing  of  flexible  structures;  in  particu¬ 
lar,  the  case  where  the  structure  model  contains  a  single 
structural  mode.  A  nonlinear  feedback  controller  is  ob¬ 
tained  that  blends  the  time-optimal  characteristics  of  a 
switching  control  law,  and  the  fine  tracking  properties  of 
a  properly  tuned  linear  regulator.  The  approach  pursued 
here  is  a  reiteration  and  extension  to  the  original  work 
by  Workman  [8],  [10],  [9]  on  a  proximate  time-optimal 
servomechanism  (PTOS),  and  on  an  extended  adaptive 
form  of  PTOS  (XAPTOS)  for  control  of  both  rigid  struc¬ 
tures  with  high  frequency  unmodeled  dynamics  that  are 
outside  of  the  performance  bandwidth,  as  well  as  flexible 
structures  with  resonances  close  to  but  beyond  the  band¬ 
width  of  the  linear  regulation  mode  of  operation.  The 
techniques  exposed  in  [10],  [9]  are  extended  by  taking 
directly  into  account  the  existence  of  a  lightly  damped 
structural  mode  as  weis  done  by  Workman  in  [10],  and 
in  addition  provide  a  technique  suggested  by  Franklin 
which  aids  in  extension  of  XAPTOS  (or  XPTOS  with¬ 
out  the  adaptive  feature)  to  the  case  of  a  flexible  mode 
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or  modes  which  may  possibly  be  within  the  performance 
bandwidth.  Numerical  simulations  show  that  the  XP¬ 
TOS  algorithm  is  robust  against  plant  parameter  varia¬ 
tions.  • 

The  paper  is  organized  as  follows.  Section  2  contains 
a  brief  description  of  the  proximate  time-optimal  ser¬ 
vomechanism  (PTOS)  presented  in  [8],  [9].  Section  3 
is  devoted  to  the  extended  proximate  time-optimal  ser¬ 
vomechanism  (XPTOS)  and  presents  some  new  theoret¬ 
ical  results  on  stability.  Section  4  illustrates  the  per¬ 
formance  of  the  XPTOS  via  a  parametric  study.  Sec¬ 
tion  5  describes  the  extended  adaptive  proximate  time- 
optimal  servomechanism  (XAPTOS)  which  combines  the 
3^TOS  with  a  weighted  recursive  least  squares  (WRLS) 
identification  of  the  plant  parameters  as  developed  in  [8]. 
Finally,  Section  6  contains  the  conclusions  and  sugges¬ 
tions  for  future  research. 

2  Time- Optimal  Control  of  a 
Rigid  Body 

2.1  The  Proximate  Time-Optimal  Ser¬ 
vomechanism:  A  Review 

Let  the  plant  P  be  described  by  the  system  of  equations 

V  =  arsat(u)  '■  ' 

where  u  and  y  denote  the  plant  input  and  output  re¬ 
spectively,  Ur  is  the  maximum  magnitude  of  (rigid  body) 
acceleration  available,  and  sat(-)  is  the  normalized  satu¬ 
ration  function,  i.e.  ,  sat(x)  =  i,  |i|  <  1  and  sat.(x)  = 
sgn(i),  [x]  >  1  where  5gn(  )  is  the  “sign"  function. 
Given  an  initial  state  (yo,wo)  at  time  <  =  0,  suppose 
it  is  required  to  reach  a  desired  state  (y,  v)  =  (j/deai  0)  in 
minimum  time.  The  well  known  time-optimal  controller 
is  given  by  [4] 

u  =  sgn(/t«(c) -n) 

/to«(e)  =  sgn(c)>/2a7H  (2) 

«  =  yd«  -  y 
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The  time-optimal  control  law  (2),  although  optimal,  is 
not  practical.  In  fact,  the  existence  of  proces-s  or  mea¬ 
surement  noise  will  make  the  control  signal  u  “chatter” 
between  its  maximum  and  minimum  values,  thus  excite 
ing  unmodeled  dynamics  that  are  always  present  in  a 
more  accurate  model  of  a  real  plant.  Even  in  the  ab¬ 
sence  of  exogenous  signals,  the  time-optimal  controller 
lacks  robustness  with  respect  to  neglected  dynamics,  as 
the  presence  of  (ignored)  lightly  damped  modes  may  in¬ 
duce  the  onset  of  limit  cycles  (Workman,  [8]).  To  cir¬ 
cumvent  these  difficulties,  a  proximate  time-optimal  ser¬ 
vomechanism  (PTOS)  was  introduced  by  Workman  [8] 
and  Workman,  Kosut,  and  Franklin  [9].  The  resulting 
control  system  is  diagrammed  in  Figure  1. 


Figure  1:  Proximate  Time-Optimal  Servomechanism 


The  equations  describing  the  PTOS  controller  are 


/pto»(£) 


'  {KJK.y,  |el<e^ 

sgn(e)  |  v''2a]c[- l//<2]  ,  !«!  >  cr 


(3) 

where  Ki,  Ki,  and  arc  positive  real  constants  and 
are  the  control  design  variables.  Observe  that  whenever 
lei  <  ec,  the  PTOS  control  is  linear,  that  is 


u  =  /{[C  —  A'tv,  V(c|  <  ef. 


Thus,  Ci  is  the  size  of  the  linear  region,  Ki,K2  arc  the 
linear  position  and  velocity  control  gains,  respectively, 
and  the  constant  a  is  a  design  acceleration. 

As  explained  in  reference  [8],  the  linear  and  nonlinear 
portions  of  /ptoj(‘)  in  (3)  can  be  connected  in  such  a  way 
as  to  preserve  the  continuity  of  /pto6(‘)  and  its  derivative. 
This  requires  the  satisfaction  of  the  following  constraints: 


a  ~ 


21<i 


('!) 


With  the  above  constraints,  we  have  the  following  stabil 
ity  result  from  [8],  [9]. 


Theorem  1  (Rest-to-llest  Stability)  Tim 
equtiibnum  slate  {y,v)  =  (j/dsaiO)  uf  Ike  system  (1)  with 
PTOS  controller  (3)  is  globally  asymptotically  stable  if 


0  <  a  <  Qp 


(5) 


The  ratio  a/ur  is  referred  to  in  [9]  as  the  acceleration 
discount  factor.  Thus,  during  deceleration  the  PTOS 
controller  uses  less  acceleration  than  is  actually  available. 
The  acceleration  a  will  be  referred  to  here  as  the  discount 
acceleration. 


2,2  Designing  the  Linear  Controller 

Let  H{$)  =  Y(s)/R(s)  denote  the  transfer  function  of 
the  closed-loop  system  of  figure  1,  when  operating  in  the 
linear  region.  With  the  usual  notation  for  a  second  order 
system,  and  using  (9),  it  follows  that 


JI(s)  = 


-t-  '2cJdWdS  -t- 

provided  that, 

li'l  =  A'o  =  2QuJdfar 

Using  the  constraints  (4), 


(6) 


(") 


a--ar/(2Ci)  e;  =  Ur  A 


(S) 


Clearly  then,  from  the  above  Theorem,  a  <  Or  if  O  > 
l/\/2,  and  hence,  rest-to-rest  stability  of  the  PTOS  sys¬ 
tem  is  insured. 

Thus,  the  overall  closed-loop  system  bandwidth  '-oj  and 
damping  factor  0  can  be  independently  controlled  by  the 
parameters  Ki  and  a,  respectively.  As  shown  in  (8]-[9], 
the  PTOS  algorithm  can  be  applied  to  the  control  of  a 
purely  rigid  body  with  high  frequency  dynamics  that  are 
outside  of  the  performance  bandwidth  coj.  However,  if 
the  structural  modes  are  within  the  desired  closed  loop 
system  bandwidth,  then  they  must  be  treated  explicitly 
in  the  initial  design.  This  is  the  subject  of  Section  3. 


3  Time- Optimal  Control  of  a 
Flexible  Structure 

The  flexible  structure  to  be  controlled  is  described  by 


Xr  “ 

tv 

Vr  - 

a,  sal(u) 

j'/  - 

—2Qu>jVf  —  uijxj  T  Uj  sat(u) 

y  ~ 

Ir  +  1-7 

The  transfer  function  from  sal(«)  to  the  output  y  is  then 
P(s)  =  Pr{s)  +  Pj{s)  where 


Pr{s)  r. 


Pj{s) 


S'  -f-  -f  Wy 


(10) 


'IP 


(j-,,!!,)  denote  the  rigid-body  mode  and 
(xj,vj)  denotes  the  lie  able  mode  at  frequency  uj  with 
modal  damping  (^j.  Without  any  rate  damping  mecha¬ 
nism,  Qj  <<  1,  a  typical  range  being  G  [.001,. 005]. 
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3.1  The  structure  of  the  XPTOS 

The  rationale  behind  the  XPTOS  can  be  explained  afi 
follows.  To  steer  the  structure  from  an  initial  rest  posi¬ 
tion  to  a  target  rest  position,  the  PTOS  control  strategy 
is  first  applied  to  control  the  rigid  body  motion,  and  later 
smoothly  blended  into  a  linear  state-feedback  control  law 
th;-,!,  actively  damps  out  the  structural  mode  vibration  in¬ 
duced  by  the  PTOS  control  law.  Although  the  idea  for 
the  XPTOS  sintciurc  was  already  presented  in  [8],  here 
we  define  an  approach  to  the  problem  of  flexible  modes 
within  the  bandwidth  of  the  controller  by  separating  the 
rigid  body  motion  and  the  fle..xible  motton. 


Figure  2'  The  Extended  Proximate  Time  Optimal  Ser- 
vornechauism  (XPTOS) 

The  resulting  control  scheme,  depicted  in  Figure  2,  is 
described  as  follows; 

V  -  K2(/ptoi.(^r)  -  t'r]  -  ti/ 

iij  =  +  K^VJ 

The  quantities  Xr,Vr,xj,v/  are  estimates  (or  measure¬ 
ments)  of  Xri  Vr,xj,vj.  respectively,  and  Cf  is  an  estimate 
of  the  rigid-body  position  error  Cr  =  —  ^r-  Observe 

that  in  the  PTOS  coutroller  (3)  the  error  used  there  is 
the  output  error  e  =  yde,  —  y. 

Suppose  that  y,u  are  the  available  mea;,uremcnts. 
Then  a  state  observer  system  is  of  the  form: 

=  Vr  +  Li{y~y) 

V,  =  arsat(u) -b  Li(2/ -  i/) 

xj  -  Vf  +  Lz(y-y) 

vy  —  — 2i^^a)y{iy  —  tvyXy -b  ay  sat(ti) -4- L<(y  —  y) 

y  -  ir  +  Xj 

(12) 

The  observer  gains  Li,...,La  arc  chosen  by  the  usual 
means  so  that  (xp,  I'v,  zy ,  i)y)  (xr,  Ur,  3:y,  v/)  as  i  —>■ 
CO,  exponciioin liy  fast.  Recall  that  to  select  the  observer 
gains  in  this  way  requires  precise  kiiowh.'  uj  of  the  plant 
parameters  ar,wy,<(y,  and  uy. 


feedback.  Due  to  limited  space  the  proof  is  omitted  and 
is  available  upon  request.  The  control  is 

fir  “  Vden  0'^) 

«y  ~  KzXf+K^Vf 


Before  stating  the  stability  result,  we  define  the  transfer 
functioiui  Cr'(^')  and  F{s)  by 

_ ^rjs) _ 

r(s)  _  -b-  (S)  _  ^  ^  {K^s)Pr{s)  -b  {Kz  -b  K^s)pj{s) 

..(H) 

With  P-(«)  and  Pj{s)  from  (10), 


F(s)  = 


ar(.5^  -b  2C/tU/S-bU’y) 

-b  y3s^  -b  Ts  -b  KzOrUy 


(15) 


where  P  =  ifaUr -b  +2C/^j  and  T  =  Wy.-b  Kz^j  + 
2<ywyJv2“r-  Observe  that  G(s)  b  the  transfer  function 
from  Kzfptoti^)  to  Xr  when  the  control  is  unsaturated, 
i.e.  ,  lu[<  1. 

Theorem  2  (XPTOS:  Ibest-to-Ibest  Stability) 
Rtferring  io  ihe.  sysitm  of  Fignr:  £  as  described  by  (9) 
and  (11),  suppose  that: 

(i)  F{s)  in  (15)  is  stable. 

(ii)  Therr  exists  a  constant  a  >  0  cuch  that  for  G{s)  in 

(U). 

Re[(l -b  iw9)G(yu>)] -b  1/A'i  >0,  Vw  G  HI  (16) 


(iii)  The  discount  acceler 


( •)  satisfies 


0  <  a  < 


a  r 


(17) 


where 

gj{i)  ----  r-W(/f3-bA4-’)^/(*0} 

hjit)  X.  £-i{(-w}/0+  (18) 

{Kz-2QujjK^)]Pj{s)} 

and  where  |1  •  ]|i  deucies  the  Li-norm  l|/||i  = 

ST\nt)\dt. 

Under  thes'  'udilw.  system  is  rest-io-rcsi  sta¬ 

ble,  that  i.s,  .j  'tir  sys .  is  initialln  at  rest,  then  for 
all  e  Ill,  tl>t  i^ate  (x^,  fr,  xy ,  ny)  asymptotically  ap¬ 
proaches  (i/dM,  0, 0, 0). 


Graphical  Frequency  Te.st 


3.2  Stability  of  XPTOS:  Pull  State  Feed¬ 
back 

We  now  present  a  new  stability  result  for  re;d.-to-re.st  ma¬ 
neuvers  of  (9)  using  the  XPd  control  with  full  State- 


Condition  (ii)  has  the  following  well  known  geonietnr. 
interpretation,  .</.  ,  [1].  Plot  tvlm[G(jn.’)]  vs.  Re[G(jw)] 
as  w  varie.3  in  [0,cx3).  This  plot  b  known  as  the  Popov 
plot  of  G(s).  Condition  (ii)  states  that  the  Popov  plot 
must  lie  to  the  right  of  a  straight  line  of  frositivc  .slope  1/? 
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paussing  through  the  point  — l/Ki  +  JO.  Thus,  condition 
(i,i)  can  be  checked  graphically, 

A.s  an  e.'cample,  we  show  that  Theorem  I  (control  of  a 
pure  rigid  body)  is  a  special  case  of  Theorem  2  with 

a/  =  K3  =  ~  0 

Thus,  condition  (iii)  becomes 

0  <  a  <  Or 

as  stated  in  (5).  To  satisfy  (i)  and  (ii),  we  have 


In  this  case  F{s)  from  (1.5)  is  stable  because  Cr  and  /vA 
are  both  positive.  Moreover, 


Re[(S’(jw)] 

wIm[G(iw)] 

Thus, 

t.jlm[<j(_7ui)]  —  a,./'f2Re[G(jui)] 

This  means  that  the  Popov  plot  is  a  straight  line  -  arougli 
the  origin  with  a  positive  slope  and  i.s  always  to  the  right 
of  the  point  —l/ZCi  4-  jO  for  any  value  of  A'l  >  U,  and  .so 
(iii)  nolds.  This  proves  the  PTOS  Theorem. 

4  Parametric  Study  of  XPTOS 

7'he  study  described  in  this  section  was  motivated  by  the 
following  question:  for  what  values  of  the  plant  parame¬ 
ters  ar,af,u>f,Q  i.s  the  XPTOS  effective  in  rapidly  slew¬ 
ing  the  structure  from  an  initial  to  a  fuial  rest  po.sitioii? 
la  particular,  (i)  how  far  is  the  XPTOS  performance  from 
being  time-optimai,  and  (d)  what  improvement  does  it 
show  when  compared  with  the  slewing  maneuvers  that 
are  obtained  by  simply  using  the  PTOS,  with  complete 
disregard  for  the  struct!  lal  inode?  In  this  section  we  a-y- 
sess  the  domain  of  appl.cability  of  t'lc  XPTOS  by  fixing 
the  rigid  body  parameter  cir,  and  letting  aj  and  ui/  vary 
over  a  certain  range.  For  simplicity,  we  consi  ler  the  c  i.se 
where  Q  ~  Q. 

4,1  Time-Opti;  .lal  Slewing 

Time-optima!  rest-to  iest  tnane'.'ver.s  for  flexible  struc¬ 
tures  with  control  Input  saturation  have  been  charactf-r. 
ized  in  the  work  of  Ijen-A.sher,  Puins,  and  Cliff  [JG]  and 
Singh,  Kabamba,  and  Mr,()'laniroch  [17],  For  i,he  prob¬ 
lem  at  haiii),  tlie  lirne-of  nmniii  re:  t- to-i irSt  control 
exists  and  is  eniqim,  i.s  bang-ban, g,  and  <:xiiibit.--i  anti.syin- 
metne  characieri.slict,  i.r.  , 

u'{ij  -  t)  —  --n'fty.  4-  rj,  0  <  7  £ 


O-T 


-4-  apA'I 
a?  AG 


where  ij  is  the  optimum  slewing  time  (see  [17].  Fur¬ 
thermore,  u*(0  can  be  computed  using  the  methods  de¬ 
scribed  in  [17]  and  [16],  For  the  flexible  structure  (9), 
(10)  with  Or  =  30  and  </  =  0,  Figure  3  is  a  plot  of 
the  optimal  slewing  time  as  a  function  of  the  structural 
mode  natural  frequency  wj,  as  the  .structure  is  subject 
to  rest-to-rest  maneuvers  of  amplitude  jydes!  =  1-  This 
represents  a  fundamental  limitation  to  the  performance 
that  is  achievable  with  bounded  input  control,  and  as 
such,  it  is  a  benchmark  against  which  the  performance 
of  other  algorithms  can  be  compared.  An  interesting  fact 
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Figure  3:  Time-Optimal  Control  of  a  Flexible  Structure 
(Performance  Characteri.Aics) 

is  that  the  optimal  control  strategy  is  independent  of  the 
ratio  ajfar,  and  the  opt'mal  slewing  time  is  very  close 
to  the  rigid  body  optimal  maneuver  time  for  suflicieritiy 
large  values  of  w/ .  As  tlie  natural  frequency  of  the  struc¬ 
tural  mode  decreases,  the  optima!  slewing  lime  becomes 
increasingly  larger  (sec  [16]  for  a  thorough  discurssion  con- 
cer.ring  the  asymptotic  behavior  of  the  time-optimal  so¬ 
lution). 

4.2  Performance  of  PTOS  and  XPTOS 

''  lie  I’l  Ob  and  XPTOS  control  algc-rithmi;  v.'cre  tested 
by  suijjecling  the  flexible  structure  (9),  (10)  to  rest-to- 
rest  maneuvers  wilii  amplitude  ji/dril  —  F  ^  he  parmn- 
clers  Ur  and  C/  "'•'tc  fixed  at  3.0  and  0.0  re:;];ectively, 
while  dj/dr  and  uij  were  varied  over  the  discrete  .sets 

{.25,.5,-'.0,2.0,;!.OJ  , 

and 

= -'8.9, 10,  r2,  14,  10, 18,20,  ;F),d0,.50)  , 

rrepecli ve!y.  A.ssuiTiing  that  brjth  Hie  oul,)in  position 
and  vrdocity  aiui  available  ior  measurement,  tie.  P'I’GS 
wa.s  designed  taking  into  a,.couiit  the  rigid  '1;  '  ■  motion, 
oniy  This  led  to  thf;  de.-sign  liaraine'er.:  A'<  =:  20  and 
a/fir  —  .0,  ii  respective  of  llie  valin-s  a.‘e>nine(i  by  tlie  pa- 
tairieler:i  in  the  set;,  and  .SC,  lignie  4  i'  a  plot 
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of  sk-wing  timc;  /eraus  a//ar  and  wijere  the  slewing 
time  here  is  defined  as  the  time  after  which  the  tiaeVing 
error  —  y  becomes  smaller  than  .01. 


5.1  XAPTOS  for  »  Flexible  Structure 

The  main  idea  behind  XAPTOS  is  to  combine  a  gen¬ 
eral  self  tuning  observer  controller  (GSTOC)  [8]  with  the 
XPTOS  controller.  A.  weighted  recursive  least  squares 
CVVRLS)  algorithm  is  used  to  identify  the  unknown  plant 
parameters  and  a  state  space  model  of  the  plant  is 
formed.  The  estim..ior  and  control  gains  are  obtained 
via  pole  placement  and  uiM  in  the  XPTOS  controller. 

A  development  of  XAPTOS  for  the  flexible  structure 
(9),  (10)  is  now  described.  The  overall  dynamic  re¬ 
lationship  between  the  input  and  output  of  the  struc¬ 
ture  is  described  by  the  discrete-time  transfer  function 
=  Pr{z~^)  -f  where 

Pr{z-^) 

and  T  is  the  sampling  interval.  Assuming  that  the  rigid 
body  parameter  Or  is  known,  the  problem  is  to  deter¬ 
mine  the  flexible  mode,  parameters  aj  and  cjj  from  the 
measurement  vector  whi.,h  forms  the  regressor 

=  l>K^  - 1)  - 1)1 

where. 


Figure  4:  Performance  of  PTOS  and  XPTOS  (Slewing 
Time  Versus  Frequency  of  Structural  Mode) 

For  large  values  of  uif  and/or  small  values  of  oj  /or, 
the  practical  slewing  time  approaches  the  time-op’  imal 
rigid  body  maneuver  time.  As  the  frequency  u)/  de¬ 
creases  or  oy/or  increases,  the  PTOS  maneuver  time  in¬ 
creases  rapidly  in  comparison  with  the  theoretical  opti¬ 
mum  value.  The  XPTOS  maneuver  time  exliibits  the 
same  trend  in  behavior,  but  the  performance  deterio¬ 
ration  occurs  at  a  slower  rate.  In  fact,  over  the  range 
of  frequencies  [8,  20]  rad  sc-c"  1  its  performance  shows  a 
drastic  improvement  over  the  ptrfonirducc  o';  (Mj.jucd  .villi 
the  XPTOS. 


v{k)  -  yi^)  -  2y(i.-  -  1)  -f  vik  -  2)- 

co(u(fc  —  1)  -f-  u(ifc  —  2) 

;i(jk)  =  u{k)~u{k-l)-u{k-2)  +  u{k-Z) 

■j3 

Co  =  Or  -V 

cj  =  'icosoij-T 

Then  the  input-output  behavior  is  described  by 

Yj{k)=:0'^(k)<P{k),  k  =  0,1,2,... 

where  0{k)  ■—  (cj  and  Y/{k)  —  Ti{k)  -f-  r/(jb  —  2)  is  a 
function  of  y(j)  and  u(/),  j  =  k~A, . .  .,k.  The  parameter 
vector  0{k)  can  then  be  determined  using  WRLS: 


oik,-i) Hkh-\ _ 

^  ^  +4>^{k)P{k-]. 


)m] 


{y{k)~Yj{k)} 


5  Adaptive  Control 

If  tb./  flexible  p.;.i'a,nieter.s  w/  and  are  not  known  or  arc 
slowly  changing,  the  pcforii!  Mice  of  the  flexible  structure 
may  be  improved  with  an  adajit've  XPTOS  algorithm 
called  XAPTOfl.  lu  thi.s  section  the  XAPTOP  strategy 
(see  ]8],  f2l,  (I'Jj)  is  described  and  applie.;  o  the  flexible 
stnicvaie  (9),  (10).  The  di.screte-tiine  system  i.s  .norc 
efficient  tf)  impi’inent  than  tlie  continuous-time  .syslcin 
and  hence  a  discrete-time  version  of  the  XPTOS  control 
(11)  is  conilsined  with  a  parameter  identifier  and  a  state 
estimat!';  ii.)  lurai  XAP'J’O.S. 


-  i  PH-  -  O  l\ -  l)m<i>^{k)P{k  -  1)  ■ 

7  ’’  7  I  Y  4'^\k)P{k  ~  l)4>{k)  , 

vdiere  0  <  7  <  1  and  k  1  —  7. 

With  the  parameter  estimates  n j  and  wy ,  a  state-space 
model  of  the  plant  i.s  formed  which  partiuions  the  known 
double  integrator  plant  from  the  idcntificxl  flexible  mode 
and  lias  the  state  vector; 

■  X^(k)  [jTp  Vr  Xf  Vf] 

A  state  vetcor  of  this  form  is  desirable  since  the  position 
and  velocity  estiniate.s  are  needed  in  the  XPTO.S  control 
alg.'irithm. 


With  the  state  space  modeh  a-  state  estimator  and  a  and  tracking  properties  of  the  XPTOS,  as  well  as  its  to- 
contioller  are  constructed  for  the  plant.  The  desired  esti-  Tusinoss  against,  plant  parameter  variation.  A  detailed 


mator  and  controller  characteristic  polynomials,  a*  and 
Oe,  are  formed  by  the  knov?n  rigid  body  mode  pole  loca¬ 
tions  and  a  radial  pole  projection  of  the  identified  f.c.xibte 
mode  pole  locations.  The  estimator  and  control  gains  are 
determined  via  pole  placement.  A  closed  form  e.xpression 
of  the  estimator  gains,  L,  and  the  control  gains,  K,  are 
determined  and  used  on-line  in  the  XAPTOS  algorithm; 

L  = 

K=r.  e^[r  4>r  'i>=r 
5.2  Simulation  Results 

The  XAPTOS  algorithm  as  described  above  but  with 
full-state  feedback  was  implemented  in  simulations  on 
the  flexible  structure  (9),  (10).  Figure  5  presents  the 
results  of  the  identification  ofay  and  uy.  Future  results 
will  include  the  ca.se  of  XJkPTOS  with  both  the  identifier 
and  the  state  estimator. 


Figure  5;  Identification  of  Plant  Parameters  ay  (top)  and 
uiy  (bottom) 


6  Concluding  Remarks 

In  this  jjaper,  we  have  dcv<doped  an  bxtimded  Ptuxiiiiate 
Time-Optimal  Servo  (XP'rOS)  for  rapid  slewing  of  llexi- 
ble  structure  models  containing  a  .single  structural  mode. 
Some  preliminary  resiill.s  were  olitained  concerning  the 
stability  of  the  proposed  clo.sed-loop  control  .scheme.  'File 
performance  of  the  control  algorithm  wa.s  illustrated  with 
a  representative  e.xample,  which  display;;  lh<;  line  ;;h.wiiig 


parametic  .study  was  conducted  to  assess  the  domain  of 
applicability  of  the  XPTOS.  The  XAPTOS  algorithm 
was  described  and  shown  to  adaptively  control  the  fle.xi- 
ble  structure. 

Future  work  requires  a  complete  analysis  of  the  stabil¬ 
ity  of  the  XPTOS.  It  is  also  our  objective  to  refine  the 
design  technique  exposed  in  section  3  to  accommodate 
disturbances  and  sensor  noise,  and  provide  robu.stnesr. 
against  the  presence  of  neglected  high  frequency  struc¬ 
tural  modes. 
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Abstract 

Aa  approximation  to  the  tirnc-optimal  tracking  problem  is  ex¬ 
pressed  in  terms  of  a  sequence  of  linear  programs  which  are  set  up 
using  a  transfer  function  approach.  The  results  are  applicable  to  any 
linear  time-invariant  finite-dimcasional  multi-input  multi-output  plant. 
An  example  illustrates  tlie  feasibility  of  the  approach. 

1.  Introduction 

Poatryagin’s  Maximum  Principle  (sec  c.g.,  (Ij)  brings  a  complete 
solution  to  the  pioblcm  associated  with  the  minimum-time  tracking 
-(of  a  fixed  or  moving  target)  subject  to  actuator  saturation.  In  order 
for  such  an  approach  to  be  implcmcntable,  one  requires  a  characteri¬ 
zation  of  the  switching  surfaces  in  the  state-space.  Complete  solutions 
for  single-input  single-output  low  order  models  have  been  derived  in 
-the  literature.  The  bang-bang  nature  of  the  solution  requires  a  relay 
in  the  implementation;  hence,  chattering  subject  to  disturbances.  One 
way  to  overcome  this  difficulty  is  to  approximate  the  infinite-gain  non¬ 
linearity  (namely,  the  relay)  with  a  finite-giin  nonlinearity.  Hence  one 
is  bound  to  step  back  from  the  time-optimal  result  for  the  sake  of  im¬ 
plementation.  Even  if  one  reformulates  the  open-loop  minimum-time 
problem  as  a  fast  finite-time  tracking  problem  (for  a  given  reference 
trajectory,  determine  an  input  (subject  to  actuator  saturation)  so  that 
the  tracking  error  remains  at  zero  after  a  finite  time-instant)  a  feasi¬ 
ble  solution  method  for  a  closed-loop  design  gc' remains  a  challenge. 
The  demand  for  fast  tracking  with  saturating  actuators  has  produced 
a  variety  of  closed-loop  implementations  ranging  from  using  idealized 
relays,  finite-gain  relays,  adaptation  methods,  etc.  (6,V,8]  .  In  the 
meantime,  openloop  solution  to  the  problem  with  flexible  inodes  has 
been  studied  in  detail,  for  siicdfic  cases  with  no  damping.  The  result¬ 
ing  proposed  nonlinear  optimization  problems  arc  derived  using  the 
necessary  optimality  conditions  posed  by  the  Maximum  Principle,  for 
spccifir  plant  models  [2,5]  . 

In  this  paper,  an  approximation  to  tlio  time-optimal  tracking  prob¬ 
lem  is  proposed  for  linear  time-invariant  finite-dimensional  multi-input 
multi-output  plants.  I'lic  approximation  relics  on  a  transfer  func¬ 
tion  approach  to  formulate  constraints  on  the  set  of  admis.sihlc  finite 
duration  sigiiaJs  lliat  aclileve  precise  point-to-point  positioning  of  flex¬ 
ible  structures  [3]  .  The  rc.sults  repoited  in  this  pajier  have  already 
been  used  to  design  fast  finito-tirno  tracking  closed-loop  feedback  sys 
terns.  Due  to  sjiacc  IJmitalions,  a  portion  of  the  opcn-loofi  results  are 
reported.  An  cxanijile  iilustrale.s  the  fca.sibility  of  the  aiiproacli. 

2,  I’t'climliirtrie.s 

Let  the  plant,  H  ,  be  .strictly  jiroper  and  have  a  minim.d  Btate-si»ace 
description  ,  witli  n,  input.s,  n„  outputs  and  n,  stales.  'Ihe 

state-space  description  can  bo  in  continuous-time  or  diecretc-liinc.  '1  he 
results  will  be  exclusively  stated  for  the  conlinuou.s-tinic  strictly  proper 
multi-input  multi-output  jilarit  .  Since  tlic  approach  relics  on  a 
rational  transfer  function  de.scrijilicjn,  with  approjniatc  modificalions, 
discrctc-tinic  set  ting  can  also  be  handled. 

Definition  (Uj)  :  For  a  given  7'  G  (0,«j)  ,  U-p  denotes  the  set  of  all 
bounded  input.';  of  duration  7'  ,  where 

ZVr  (  u  ;  11(4  -•  Ill"'  I  u(f)  --  0  for  <  >  7’  ,  llujh,,  <  00)  .  Ij 
Definition  (7’-trBck)  :  For  a  given  T  G  (0,oo)  ,  y  i«  said  to  7'-ttack 
r  ill  y(l)  =  r{()  for  all  f  >  ']'  .  fl 

Fact  1  :  For  a  given  plant  J’  willi  a  minimal  descrijitioii  {A,H,C)  , 
and  a  reference  r  ,  there  exist;;  an  injiut  u  G  lir  such  llial  (/•’  *  ir) 
T-tracks  r  if  and  onlv  if  r  (  Hj  ,  where 

71r;=  {r;lll4  lit"”  |  r{()  =  C'c-'''xo,  Zo  G  hi"' ,  <  >  7')  .13 

Iroiu  lact  )  ,  we  conclude  that  the  Laplace  transform  of  a  signal  in 
lip  can  only  have  pole.s  at  llie  Jilanl  jioles. 

Fn.t  2:  For  a  given  plant  J'  witli  a  minimal  de.scrijition  (A,  71  6')  ,  let 

'Itcfcairclj  tuppotlcd  hy  AI-U.‘'.U  uiidc-r  coiitiacl  1  tOC'rO  88  C'-CU12 


r  G  Tip  ■  Then  there  exists  a  unique  rp  G  Up  and  a  unique  G  111"' 
such  that  r(<)  =  rp(t)  -1  Ce'^'zr  • 

3.  Transfer  Function  Approach 
Let  r  G  Tip  and  the  plant  be  at  an  initial  state  lo  at  f  =  0  .  Let 
Tp  and  Xr  describe  the  unique  decomposition  of  r  as  in  Fact  2  .  Tlie 
goal  is  to  find  a  u  e  Up  such  that 

Rp{i)  +  C(s/  -  A)-‘ir  =  C(j/  -  A)-'  (in  H  BU(_s))  , 
Without  loss  of  generality,  we  assume  that  lo  is  zero.  If  iq  is  not 
zero,  redefine  ir  as  (i^  —  lo)  • 

Proposition  1  :  Let  7'  have  the  expansion  P(a)  =  < 

where  G  .  Let  r  C  Ttp  .  Consider  the  unique  decompo¬ 

sition  r  =;  Tp  -t-  Trxy,  ,  where  rp  €  Up  and  r„p  =  Cc'^'ir  . 
Under  these  assumptions:  1.  r„p  G  Tip  ;  moreover,  71„p(s)  = 

53^1  (7^)7  '  '*'bcrc  Kij  G  fil"*  .  2.  The  set  of  all  signals 

u  €  that  {P  *  ti)  T-tracks  r  is  given  by 

{ucUr  1  -  i>.)'"'P(s)f7(.,)]]^^^^ 

i  =  1,...,L;  j  =  l,...,rn,- .□ 

A  Convex  Approximation  to  the  Minimum-Time  Tracking 
Problem  :  Choose  a  basis  of  Up  ;  for  a  Epccificd  N  ,  truncate  it 
to  a  finite  collection  .  Consider  tlie  subclass  of  signals  in  Up 

whose  Laplace  transforiiis  arc  of  the  form  I/(s)  =  yZi=iPiBi{s)  . 

Clearly,  all  of  the  maldiing  constraints  in  Proposition  1  translate  into 
a  linear  equation  in  terms  of  p  G  ,  say,  r(T)p  =  7  .  The  entries 
of  7  arc  solely  determined  by  the  entries  of  7f,y  corresponding  to  r„p 
(sec  Proposition  1  )  .  Wo  will  refer  to  the  space  7  is  in  as  the.  residue- 
space  (an  abuse  of  notation;  after  all,  not  all  entries  of  7  correspond 
to  the  residues  in  the  partial  fraction  expansion)  .  By  introducing  the 
actuation  saturation  constraints,  we  end  up  with  a  convex  feasibility 
ptoblcin: 

Find  pC  111^  such  tliat  F(7‘)p  ==  7  and  ||p^/i||oo  <  1  , 

where  /i{l)  (/ii(l)  ...  hn{t)\^  .  For  a  specified  T  and  N  ,  tlie 

inaximum  perforinance  along  the  desired  direction  7  is  determined  by 
ni.ax  ^  ^  A  G  II  ^  ^  sequence 

F(7')p’=-  A7 
<  1 

of  convex  ininiinjzalifjjj  problems  by  varying  7’  (and  A^)  lo  eweep  a 
luaxiinuin  p<irforinaiico  cuivo;  houcc,  oblaiiiiiig  an  api>roxjination  lo 
t)i4;  iniiiiiiiuin-time  problejn. 

A  IViliwobii  Cliojtr  fur  U{^)  ;  A  liiiio-o])liniai  input  signeJ  it  iieccs- 
warily  banj'-baiig.  WJiilc  Ihoro  is  no  upper-bound  on  the  finite  number 
uf  i.v.iirhin^'j.  in  peunaj,  for  a  fixed  number  of  BwilchiiigH,  say  N  ,  oiif 
can  clioose  a  te<juence  of  N  pulses  (with  aJternating  amplitudes)  v.'iih 
varying  widths  a.*,  a  family  of  functions  in  U-j'  .  One  can  then  solve 
for  l)ie  coiislraintr.  in  Proposition  1  lo  obtain  a  family  of  nonlinear 
(non  convex)  al/’/tbraic  equations  [d]  .  Similar  algebraic  equations  arf‘ 
oblained  by  applying  t)ie  Poiitryagin’s  Maximum  Princi]>le  to  Bpccific 
cases  of  J\  u.sin/'  ntate-bpace  coinpulatioiis  [2,5]  . 

For  a  fixed  7’  ,  consider  the  wciglitcd  sum  of  a  soquenre  of  A  pulses 
with  Viiiiforiii  widllis  {'I'/N)  ;  i.e.,  f/{s)  --  i  • 

Since  the  input  signal  is  pieccwific  constant,  the  actuation  bounds  can 
be  rxpf<'.sv>d  as  —  1  <  ;;  <  1  ,  where  p  0  Dl^  is  lo  be  deierniined. 

An  AjjproxiiTintlon  lo  llio  Minlimirii-Tinie  Problem  by  a  Se¬ 
quence  of  Linear  Progrnms;  lor  a  given  jda.ut  P  and  N  ,  the 
inaxiinuia  pel  fo] mance  function  ’i'  :  (f),oo)yll{  ”  — »  1I1.4  is  denned 

in  terms  of  tlie  linear  jircgrani  4'(7',  7)  max  ^  A  C  Hi  ’ 

y{'np~-  A7 
-1  <  p  <  1 

v/lieie  I  (7 )  is  obliiincd  by  Llur  inatcliing  condition:,  in  I'Toponition  1  . 
Any  pr  Il(^  hn  v.'hicli  the  linear  piogiain  returns  the  vnJue  *1^(7’, 7)  , 
is  used  to  define  the  lelalio/i  <1*  :  (y.o'j)  x  lip  ,  where 
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4(ri7)  P  '  ^  specified  tiine-instant  T*  i  7)  delcrniincs 

tte  maxim'iiii-pcrforiiiance  oac  can  achieve  along  the  specified  direc¬ 
tion  7  .  For  a  fixed  direction  7  ,  hy  sweeping  over  T  ,  one  obtains 
the  7-niaximum-pcrforniance  curve  (T ,  tfT,  7))  ,  -F  G  1^4.  ,  In¬ 

stead  of  generating  the  7-maximum-pcrformance  curve  for  T  e  JR.^.  , 
introduce  a  time-resolution  of  AT  and  discretize  the  curve;  i.o.,  for  a 
pre-determined  fcmai  I  ®val'tate  (T,^(r,7))  ,  T  =  A:AT  , 

0  S  ^  kmoT  •  ^ 

The  sub-optimal  (due  to  the  approximations)  time-instant  7  for 
which  the  residue  7  is  achieved  is  in  the  interval  T  G  (T/,r/  +  AT]  , 
where  «'(ri,7)  <  1  ,  and  9(T,  -f  Ar,7)  >  1  •  The  T-polytope  St, 

St  ■.=  {  X7  1  7  e  R"  ,  IhlU  =  !,>■£  Io,«'(T,7)!  } 
is  the  set  of  all  points  in  the  residue-space  that  can  be  reached  in  T 
seconds.  The  boundary  of  this  set,  denoted  b^;  OSr  ,  i*  referred  to 
as  the  T-isochrone.  Consider  the  T-polytope  St  which  is  a  subset  of 

St  ;St  :=  {  E?=i  I  \WU  <  U  ,  {=1 .  ■  •  -  -  <n} 

is  a  basis  in  hi"  . 

A  Map  from  the  Residue-Space  to  the  Input  Space  :  Our  goal 
is  to  construct  a  map,  possibly  using  look-up  tables,  such  that  given 
a  desired  reference  signal  and  the  states  of  the  plant,  an  input  signal 
is  generated  so  that  the  plant  output  T-tracks  the  reference  signal  as 
fast  as  possible.  The  desired  reference  signal  (which  should  be  in  V-t) 
and  the  initial  state  determines  the  amount  of  cliange  necessary  in  the 
residues  of  the  output  signal.  For  tliis  reason,  we  wiU  focus  on  the 
following  subproblem;  For  a  given  7  in  the  residue-space  determiuc 
an  input  signal  such  that  the  output  of  the  plant  achieves  tlic  residues 
specified  by  7  as  fast  as  possible.  From  now  on,  we  will  use  the  standard 
oithonormal  basis  (ci ,  . . .  ,  e„}^in  El"  .  In  order  to  cut  down  on  the 
Storage  space,  we  will  identify  St  with  its  **positive  orthant": 

St  :=  {  E?=i  A.'^(T,cOci  1  11  All,  <  1  ,  A  >  0  )  . 

Let  the  signum  function  SGf^  :  El"  — •  El"  be  defined  as  the 
entry  by  entry  signum  function  over  El  .  Let  the  operator  denote 
element-by-eleraent  product  in  El"  .  Suppose  that  7_€  dSr  •  Let  7 
be  defined  by  7  =:  SGN(^)  .  ♦  7  •  Clearly,  7  €  dSr  if  and  only  if 
7  €  dSr  •  Since  7  can  be  expressed  as  a  convex  combination 
7  =:  E"=i  Ai4'(7',c.)c,-  ,  A  2  0  ,  HAH,  =  1  , 
we  conclude  that  the  input  that  achieves  7  residue  in  T  seconds  is 
represented  by  p  =  E?=i  •  The  input  that  achieves  7 

residue  in  T  seconds  is  represented  by 
P  =  E"=i  {eTSGN(‘f))  HT,ei)  .  Nolo  that  -1  <  p  <  1  . 

We  now  outline  the  procedure: 

1.  Fix  TV  ,  A'T  and  A'mor  • 

2.  Solve  (n  •  fcmor)  linear  i-rogranis;  i.e.,  for  the  k^cz  time  points 
solve  for  and  store  ♦(/',£,)  ,  4’(T, c;)  ,  i  -  l,...,ti  . 

3.  For  a  given  7  determine  the  smallest  T  such  that  7  G  St  ■  We 
propose  the  following  procedure;  Let  ,]  €  Et*"’”'  be  defined  as 

1/  (e"=i  ^ZzT))  ’  ■  ■  •  -  • 

Let  the  fth  entry  of  7  be  the  first  entry  which  is  less  than  or  equal 
to  one.  Since  7  was  in  <S(t„„/\T)  start  witli,  such  an  I  exists. 
Hence,  we  conclude  that,  among  the  family  ol  d -isochrones,  7 
can  be  achieved  no  faster  than  'ft  seconds. 

4.  Once  the  index  t  and  tlie  muUiplior  t J r;  is  determined,  extract 
the  input  sequence  from  G  hl^  ; 

p  :=  e;^.  ■ 

Example 

Consider  the  single-inpul  single-output  plant 

F(s)  =  75+  Ei'=l  ’ 

v.'hCtC  Id,  cij  03]  “  1'^*  1  tt  2tll  .  (lui  109  to-,1  4k  [O.o  1  1.5]  rad/s  and 
ICi  <3!  =  ^-01  0.001]  .  Ch-arly,  the  admissible  r^^’s  are  linear 

combination  of  steps,  ramps  and  lightly-damiied  sinusoids.  Ihc  vast 
majority  of  tracking  problems  restricts  tiie  references  to  to  a  linear 
combination  of  steps  and  ramps; 

(  r  1  r(()  s:  ry  +  ri(  ,  ro  C  111  ,  r,  C  El  ,  f  iT  0  )  . 

In  order  to  show  the  eliiclency  of  the  jiroposed  siheiiie,  wc  will  focus 
on  actuation  signals  in  Ur  ,  when’  7  G  U,  10].'i  .  Standard  b.mg-bang 
actuation  relying  on  the  rigid-body  approximation  of  I’  for  rcsl-to-rest 


slewing  over  the  interval  11,10]  s  resulted  in  unsatisfactory  outputs, 
Wc  choose  TV  =  20  .  AU  simulations  were  performed  by  sampling  the 
plant  P  at  100  c  .  In  order  to  have  at  least  one  sample  instant  added 
on  for  each  time  step  AT  .  wc  clioose  AT  0.2s  .  We  set  up  the 
linear  programming  problem,  by  constructing  1  (T)  from  the  matching 
conditions  in  Proposition  1  .  For  illustration  purposes,  wc  assume  that 
the  plant  is  initially  at  rest  and  the  output  is  required  to  T-track  steps 
and  ramps  only.  We  solve  46  linear  programs  along  7  =  d  and  another 
40  linear  programs  along  7  =  cj  .  I^t  the  reference  trajectory  r  he 
as  shown  in  Figure  1  (dashed  line).  The  breakpoints  and  slopes  in 
Figure  1  reveal  that  the  sequence  of  inputs  should  satisfy  the  following 
increments  in  the  residue-space  denoted  by  the  ordered  pairs  (ro,r,)  ; 
at  T  =  0  ,  (-4-2,  0) ;  at  T  =  5  ,  (-2,  -1)  ;  at  T  =  12  ,  (-f  7,  -f  1)  .  Using 
the  one-hypcrplanc  approximations  to  the  46  T-polytopcs,  the  fastest 
tracking  times  for  the  sequence  of  residue  mcrcrocnls  aje  3.2 , 4.6  and 
7  s  ,  respectively.  Figure  1  shows  the  input  and  the  associa'-d  output 
of  P  . 
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Figure  1:  An  input  signal  u  (lop)  such  that  /’  ♦  u  (bollom  sobd)  fast 
T-tracks  r  (bollom  dashed) 
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